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Abstract 

The application of automatic transformation processes during the formal development and 
optimization of programs can introduce encumbrances in the generated code that program- 
mers usually (or presumably) do not write. An example is the introduction of redundant 
arguments in the functions defined in the program. Redundancy of a parameter means 
that replacing it by any expression does not change the result. In this work, we provide 
methods for the analysis and elimination of redundant arguments in term rewriting sys- 
tems as a model for the programs that can be written in more sophisticated languages. On 
the basis of the uselessness of redundant arguments, we also propose an erasure procedure 
which may avoid wasteful computations while still preserving the semantics (under ascer- 
tained conditions). A prototype implementation of these methods has been undertaken, 
which demonstrates the practicality of our approach. 

KEYWORDS: redundant arguments in functions, semantics-preserving program transfor- 
mation, analysis and optimization, term rewriting 



1 Introduction 

A number of researchers have noticed that certain processes of optimization, trans- 
formation, specialization and reuse of code often introduce anomalies in the gen- 
erated code that programmers usually (or ideally) do not write IjAho et al. 19861 
IHughes 19881 ILeuschel and S0 rensen 19961 ILiu and Stoller 2f)"02)l . Examples are re- 
dundant arguments in the functions defined by the program, as well as useless 
program rules. The notion of redundant argument means that replacing it by what- 
ever expression we like, the final result does not change; independently of actual 
computations. The following example motivates our ideas. 

Example 1 

Consider the following program that calculates the concatenation of two lists of 
natural numbers and the last element of a list, respectively: 

append (nil, y) = y last (x: nil) = x 

append (x:xs,y) = x : append (xs ,y) last(x:y:ys) = last(y:ys) 

Assume that we specialize this program for the call applast(ys,z) = 
last (append (ys,z: nil)), which appends an element z at the end of a given 
list ys and then returns the last element, z, of the resulting list; the example is 
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borrowed from DPPD library of benchmarks l|Leuschel f998jl and was also consid- 



ered in Leuschel and Martens (1995| ) and Pettorossi and Proietti (1996b I for logic 



program specialization. Commonly, the optimized program which can be obtained 
by using an automatic specializer of functional programs ( |Alpuente et al. 1997| 
|Alpuente et al. 1998| |A"lpuente et al. 1999| ) is: 

applast(nil,z) = z lastnew(x,nil,z) = z 

applast (x : xs , z) = lastnew(x,xs,z) lastnew(x,y:ys,z) = lastnew(y,ys,z) 

The first argument of the function applast is redundant (as well as the first 
and second arguments of the auxiliary function lastnew) and would not typi- 
cally be written by a programmer who writes this program by hand. This pro- 
gram is far from {applast' (ys,z) = lastnew' (z), lastnew' (z) = z}, a more fea- 
sible one with the same evaluation semantics, or even the "optimal" program — 
without redundant parameters — { applast" (z) = z} which one would ideally ex- 
pect (here the rule for the "local" function lastnew' is disregarded, since, after 
optimizing the definition of applast', it is not useful anymore). Note that stan- 
dard (post-specialization) renaming/compression procedures ( jAlpuente et al. 1997| 
|Gallagher 1993|[Gliick and S0rensen 1994b cannot perform this optimization as they 
only improve programs where program calls contain dead functors or multiple oc- 
currences of the same variable, or the functions are defined by rules whose rhs's are 
normalizable. 



Therefore, it seems interesting to formalize program analysis techniques for de- 
tecting these kinds of redundancies as well as to formalize transformations for elim- 
inating the dead code that appears in the form of redundant function arguments 
or useless rules and which, in some cases, can be safely erased without jeopardizing 
correctness. 

In this work, we investigate the problem of redundant arguments in Term Rewrit- 
ing Systems (TRSs), as a model for the programs that can be written in more sophis- 
ticated equational, functional, or functional-logic languages. We provide a semantic 
characterization of redundancy which is parametric w.r.t. the observed semantics 
S. After some preliminaries in Section El in Section |31 we consider different (re- 
duction) semantics S, including the standard normalization semantics (typical of 
pure rewriting) and the evaluation semantics (closer to functional and equational 
programming). In Section0]we introduce the notion of redundancy of an argument 
w.r.t. a semantics S and provide some useful properties. In Section0we derive a de- 
cidability result for the redundancy problem w.r.t. S and provide the first effective 
method for detecting redundancies, which is based on approximation techniques. 
Then, in Section we provide a more practical method to recognize redundancy 
which allows us to simplify the general redundancy problem to the analysis of the 
rhs's of the program rules. 

At first sight, one could naively think that redundant arguments are a straight 
counterpart of "needed redex" positions ; Unci and Levy 1991| ), a well-known oper- 
ational notion in term rewriting, which could be easily neutralized by appropriately 
driving the computation. Unfortunately, this is not true as illustrated by the fol- 
lowing example. 
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Example 2 

Consider the optimized program of Example ^ extended with: 

take(0,xs) = nil 

take(s(n) ,x:xs) = x:take(n,xs) 

The contraction of redex take (1 , 1 : 2 :nil) at position 1 in the term 1 
t = applast (take (1 , 1 : 2 :nil) ,0) is needed to normalize the term t to the con- 
structor normal form 0. This means that such redex position (or one of its resid- 
uals) must be reduced in each rewriting sequence from t to its normal form (see 
and Levy 1991| )). However, the first argument of applast is redundant for 
normalization, as we showed in Example Q an d the program could be improved by 
dropping this useless parameter. Therefore, although needed redexes are an essen- 
tial piece of the computational process which implements the evaluation, from a 
semantic point of view, they can be irrelevant (redundant). 

Since needed redexes must all be reduced in any reduction sequence leading to a 
normal form, Example shows that no normalizing reduction strategy is able to 
dodge the problem by avoiding the exploration of the redundant argument. Thus, 
in general, inefficiencies caused by the redundancy of arguments cannot be avoided 
by using rewriting strategies. Therefore, in Section we formalize an elimination 
procedure which gets rid of the redundant arguments and provide sufficient condi- 
tions for the preservation of the semantics. Preliminary experiments in Section |H1 
indicate that our approach is both practical and useful. 

An extensive comparison with the related literature is provided in Sectional We 
summarize some relevant ideas as follows. Strictness analysis 2 i|Burn et al. 19861 
Burn 199lll,Iensen 199l"l|Mycroft 1980||Mycroft and Norman 1992|ISekar et al. 19901 
|Wadler and Hughes 19871 ) can be used to determine whether the evaluation of an 
argument within an expression e = f(e%, . . . , e$, . . . , e*) is "strictly" necessary to 
obtain the value of e. The counterpart of this notion has been studied in a number of 
different analysis techniques such as dead code analysis <|Liu and Stoller 2002(1 . un- 
neededness analysis (Hugh e~1988| ), absence analysis (|Cousot and Cousot 1994J) . Rl- 
tering analysis (Leuschel and S0rens en 19961 ), or useless analysis I) Wand and Siveroni 1 999). 
Also, similar techniques to detect and remove parts of a program which are com- 
putationally irrelevant have been investigated in the past: program specialization 
dAlpuente et al. 1997||Alpuente et al. 1998||Alpuente et al. 1999|ILeuschel and Martens 19951 
IPettorossi and Proietti 1994llPettorossi and Proietti 1996a|) . slicing IjGouranton 1998 
ISchoenig and Ducasse 1996||Reps and Turnidge 1996||gaTagyi et al. 2002||Tip 1995| 
IWeiser 19 84). compile-time garbage collection ( |jones and Metayer 1989||Park and Goldberg 1992| 
Kno op et al. 1994] ), and dead code removal IjBerardi et al. 20001 |Kobayashi 2000| 
ILiu and Stoller 2002|> . 

1 In this paper, naturals 1,2,... are often used as shorthand to numbers s n (0) where n = 1, 2, 

2 Roughly speaking, a function symbol / is strict in its i-th argument if any subterm at such 
argument position must be completely evaluated during the evaluation of /. In symbols: let 
Di, . . . , Dfc, D be ordered sets with least elements J_i, . . . , _L^, _L respectively, expressing un- 
definedness, a mapping / : Di X ■ ■ • X D k — ► D is said to be strict in its i-th argument if 
f(di,. . . ,±i, . . . ,dk) = -L for all di e D\, . . . , d k 6 D k . 
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In Section 1101 we briefly discuss the detection of redundant arguments in func- 
tional logic programs mechanized by narrowing. We conclude in Section HP Proofs 
of all technical results are given in |Appendix A| 

This paper is a revised and improved version of flAlpuente et al. 2002b| ). 



2 Preliminaries 

Term rewriting systems provide an adequate computational model for functional 
and equational programming languages which allow the definition of functions 
by means of patterns, e.g., Haskell, Hope, or Miranda (Baade ~and Nipkow 1998| 
|Klop 1992) |Plasmeijer and van Eekelen 1 993). In the remainder of the paper we 
follow the standard framework of term rewriting for developing our results; see 
( |Baader and Nipkow 1998| ITeReSe 2 003 ) for missing definitions. In order to sim- 
plify our presentation, definitions are given in the one-sorted case; the extension 
to many-sorted signatures is not difficult l|Padawitz 1988|) . and we comment where 
they matter the non-obvious details. 

Let — > C A x A be a binary relation on a set A. We denote the inverse of — > by 
<— , the symmetric closure by the transitive closure by -^ + , the reflexive and 
transitive closure by — >*, and the reflexive, symmetric and transitive closure by <->•*. 
We say that — * is confluent if, for every a, 6, c € A, whenever a — >* b and a — >* c, 
there exists d £ A such that b — >* d and c — >■* d. We say that — > is terminating (or 
well-founded) iff there is no infinite sequence a\ — > — * 03 

Throughout the paper, X denotes a countable set of variables {x,y, w, . . .}, and 
J- denotes a finite set of function symbols {f , g,h, . . .}, each one having a fixed arity 
given by a function ar : T — > N. By X) we denote the set of terms and by 

T(J-) the set of ground terms, i.e., terms without variable occurrences. Var(t) is 
the set of variables in t. A term is said to be linear if it has no multiple occurrences 
of a single variable. A fc-tuple ti,. . . ,tk of terms is written t. The number fc of 
elements of the tuple t will be clarified by the context. 

A substitution is a mapping a : X — * T(J-, X) which homomorphically extends 
to a mapping a : T{T , X) — > T(J r , X). The substitution a is usually different from 
the identity, i.e., Mx S X : id(x) — x, for a finite subset T>om{a) C X, called the 
domain of a. By 6 o a we denote the composition of the substitutions a and 9, i.e., 
0o<r(a;) = 8(a(x)). Let Subst(J-, X) denote the set of substitutions and Subst(!F) be 
the set of ground substitutions, i.e., substitutions on T(J-). If cr(t) is a ground term, 
we call a a grounding substitution for £. A unifier of two terms i, s is a substitution 
er such that o~(t) = cr(s) and cr is idempotent, i.e., a o a = a. A most general unifier 
(mgu) of t, s is a unifier cr such that for each unifier a' of £, s there exists such 
that a' = 9 o a. By er|y we denote the restriction of subsitution a to the variables 
in V. 

Terms are viewed as labelled trees in the usual way. Positions p, q, . . . are defined 
as sequences of positive natural numbers used to address subterms of t, with A the 
root position (i.e., the empty sequence), p.q the position concatenation, and p < q 
the usual prefix ordering. Two positions p, q are disjoint, denoted by p || q, if neither 
p < q, p > q, nor p = q. The symbol labeling the root position of t is denoted as 
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root(t). The subterm at position p of t is denoted as t\ p and t[s] p is the term t 
with the subterm at position p replaced by s. The restriction of a set of positions 
P w.r.t. a position p is defined as P\ p — {p' | 3g G P A g = p.p'}, the concatenation 
of a position p and a set of positions P is defined as p.P = {p.q \ q G P}, and 
the comparison of a set of positions P w.r.t. a position p is defined as p < P iff 
p < q for each g G P. By Voss{t) we denote all positions in t with a symbol or 
variable from S C J 7 ij X. We use Pos/(t) and Vos{t) as shorthands for 7 ? 0S{y:}(t) 
and Vosyr u x(t), respectively. A context is a term C with zero or more 'holes', i.e., 
the fresh constant symbol □. We usually write simply C[ ] to denote an arbitrary 
context, clarifying the number and location of holes 'in situ'. If C is a context and 
t a term, C[t] denotes the result of replacing the hole in C by t. 

A rewrite rule is an ordered pair (I, r), written 3 / — ► r, with I, r G T{T, X), I £ X 
and Var(r) C Var(Z). The left-hand side (Ihs) of the rule is I and r is the right-hand 
side (rhs). A TRS is a pair TZ = (J 7 , R) where R is a set of rewrite rules and T is 
called the signature. A term t rewrites to s (at position p), written t -^k s (or just 
t — ► s), if t| p = er(Z) and s = t[er(r)] p , for some rule I — > r G _R, p G Vos(t) and 
substitution <r. An instance <r(i) of the Z/is of a rule I — > r is called a redex; similarly 
subterm f | p in a rewrite step is also called a redex. A term t without redexes is said 
a normal form. By we denote the set of finite normal forms w.r.t. 1Z. A term 
t is said a head-normal form (or root-stable) if it cannot be rewritten to a redex. 
By HNF75. we denote the set of head-normal forms w.r.t. 7?.. 

A TRS 1Z is left linear if all its lhs's are linear terms. A TRS 1Z is ground (resp. 
right-ground) if all its lhs's and rhs's (resp. only its rhs's) are ground terms. A 
TRS 1Z is terminating (resp. confluent) if the relation -^-jz is terminating (resp. 
confluent). Two terms t, s are joinable, denoted by t I s, if there exists a term u 
such that t u and s u. 

Given 1Z = (J 7 , R), we assume T can be always considered as the disjoint union 
T = C tt) T> of symbols c G C, called constructors, and symbols / G T>, called defined 
functions, where V = {/ | /(I) ->• r G i?} and C = T - P. Then, T(C,X) is the 
set of constructor terms. A pattern is a term f{l\, . .. ,l n ) such that / G T> and 
l\,...,l n G T(C, <Y). A constructor system (CS) is a TRS whose lhs's are patterns. 

Two (possibly renamed) rules I — > r and ?' — ► r' overlap, if there is a non- variable 
position p G Vosjr[l) and a most-general unifier er such that c(l| p ) = ct(Z'). The pair 
(<T(Z)[(T(r')]p, cr(r)) is called a critical pair and is also called an overlay if p = A. A 
critical pair (t, s) is trivial if t = s. A left-linear TRS without critical pairs is called 
orthogonal. Note that orthogonality of a TRS 1Z implies confluence of — ►•r.. A left- 
linear TRS where its critical pairs are trivial overlays is called almost orthogonal. 

3 Semantics 

The redundancy of an argument of a function / in a TRS TZ depends on the seman- 
tics properties of TZ that we are interested in observing. Our notion of semantics is 
aimed to couch operational as well as denotational aspects. 

3 We will use also I = r to differentiate a rule from a rewriting step. 
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A term semantics for a signature J 7 is a mapping S : T(J-) — > V{T{J-)) IjLucas 2001|l 
which associates a set of terms to a term. A rewriting semantics for a TRS 7?. = 
(J 7 , R) is a term semantics S for T such that, for all t € T(^F) and s € S(t), i — ^ s, 
i.e., a term semantics where the set of terms associated to a term is determined 
only by the program. 

The rewriting semantics which is most commonly considered in functional pro- 
gramming is the set of values (ground constructor terms) that TZ is able to produce 
in a finite number of rewriting steps (evalfj(t) = {s £ T{C) \ t — >^ s}). Other 
kinds of rewriting semantics often considered for TZ are, e.g., the set of all pos- 
sible reducts of a term which are reached in a finite number of steps (redn(t) = 
{s 6 | t — >^ s}), the set of such reducts that are ground head-normal forms 

(hnf = red-ft(t) n HNF-r,), or ground normal forms (nf^i) = hnf ??.(£) n NF^). 
We also consider the (trivial) semantics empty which assigns an empty set to ev- 
ery term. We often omit TZ in the notations for rewriting semantics when it is 
clear from the context. Furthermore, a rewriting semantics S for a TRS TZ is called 
(1Z-) normalized if, for all t € T(F), S(t) C NF-jz, i.e., the semantics associates only 
normal forms to a term, eval and nf are examples of normalized semantics whereas 
hnf and red are not normalized. 

The ordering ^ between semantics l|Lucas 2001|l provides some interesting prop- 
erties regarding the redundancy of arguments. Given term semantics S and S' for 
a signature J 7 , we write S < S' if there exists T C T(T) (called window set of S' 
w.r.t. S) such that, for all t £ T(T), S(t) = S'(t) n T. Note that, then, we have 
empty -< eval-^ < nf-^ < hnf^ < red^. 

Given a rewriting semantics S, it is interesting to determine whether S provides 
non-trivial information for every input expression. Let 72. be a TRS and S be a 
rewriting semantics for 1Z, we say that 1Z is S-defined if for all t £ T(T\ S(t) ^ 
IILucas 2001(1 . S-definedness is monotone w.r.t. ;<: if S ^ S' and 1Z is S-defincd, 
TZ is also S'-defined. 

S-definedness has already been studied in the literature for different seman- 
tics ([Lucas 200lJ. In concrete, nf-defined TRSs are known as normalizing TRSs (i.e., 
every term has a normal form ( |Baader and Nipkow 1998| |) and eval-definedness is 
related to termination and the standard notion of completely defined (CD) TRSs; 
see ( |Kapur et al. 1987|lKounalis 1985J) . Roughly speaking, a defined function sym- 
bol is completely defined if it does not occur in any ground term in normal form, 
that is to say that functions are reducible on all ground terms (of appropriate 
sort). A TRS TZ is completely defined if each defined symbol of the signature is 
completely defined. In one-sorted theories, completely defined programs occur only 
rarely. However, they are common when using types, and each function is defined 
for all constructors of its argument types. 

Let TZ be a normalizing and completely defined TRS; then, TZ is eval-ft-dcfined. 
Being completely defined is sensitive to extra constant symbols in the signature, 
and so is redundancy. Thus, we are not concerned with modularity in this work. 

From now on, we formulate the notion of a redundant argument and provide 
some useful properties and detection techniques. 
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4 Redundant Arguments 

Roughly speaking, a redundant argument of a function / is an argument tj which 
we do not need to consider in order to compute the semantics of any call containing 
a subterm f(t\, . . . , ifc). 

Definition 1 (Redundancy of an argument) 

Let S be a term semantics for a signature f £ J 7 , and i G {1, . . . , ar(/)}. The i-th 
argument of / is redundant w.r.t. S if, for all contexts C[ ] and for all t, s € T(J-) 
such that root(t) = f, S(C[t]) = S(C[t[s]<]). 

We denote by rarg s (f) the set of redundant arguments of a symbol / G JF w.r.t. 
a semantics S for T . Note that every argument of every symbol is redundant w.r.t. 
empty. The following result shows that redundancy is antimonotone with regard to 
the ordering < on semantics. 

Theorem 1 (Antimonotonicity of redundancy) 

Let S, S' be term semantics for a signature T . If S ~< S', then, for all / G J 7 , 
rarg s ,(f) C rarg s (f). 

The following result guarantees that constructor symbols have no redundant argu- 
ments for usual non-trivial semantics, which agrees with the common understanding 
of constructor terms as completely meaningful pieces of information. 

Proposition 1 (Non-redundancy of constructors) 

Let 72. be a TRS such that |T(C)| > 1, and consider a rewriting semantics S such 
that eval-R, < S. Then, for all c G C, rarg s (c) = 0. 

For many-sorted signatures, we would require that \T(C) T \ > 1 for the sort r of an 
argument of a constructor symbol c. In the following section, we consider several 
aspects about decidability of the redundancy of an argument. 

5 Decidability Issues 

In general, the redundancy of an argument is undecidable. However, we are able to 
provide a decidability result about redundancy w.r.t. all the non-trivial semantics 
considered in this paper. In this section, for a signature J 7 , term semantics S for J 7 , 
f G J 7 , and i G {1, . . . , ar(f)}, by "redundancy w.r.t. S" we mean the redundancy 
of the i-th argument of / w.r.t. S. 

We follow the "(W)SfcS approach" to decide a given property P, which is based 
on ascertaining the conditions for expressing P in a decidable logic, namely the 
(weak) second-order monadic logic with k successors (W)SfcS; see (|Thomas 1990(1 . 
The following theorem by Rabin is the key element for our results in this section. 

Theorem 2 \R.abin 196^ 

The (weak) monadic second-order theory of k successor functions (W)SkS is decid- 
able. 
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First, we recall some basic definitions about the WSfcS logic; see e.g., IjThomas 3993. 
Terms of the WSfcS logic are formed out of individual variables x,y, z, . . ., the empty 
string A, and right concatenation with 1, . . . , k. Atomic formulas are equations be- 
tween terms, inequations w < w' between terms, or expressions w £ X where w is 
a term and X is a (second-order) variable. Formulas are built from atomic formulas 
using the logical connectives A, V, -1, ... and the quantifiers 3,V of both indi- 
vidual and second-order variables. Individual variables are interpreted as elements 
of {1, ... , fc}* and second-order variables as finite subsets of {1, . . . , k}*. Equality 
is the string equality and inequality is the strict prefix ordering. Finite union and 
intersection, as well as inclusion and equality of sets, are definable in WSfcS in an 
obvious way. 

Let us relate TRSs and WSfcS logic. Given a finite signature J 7 , let k be the 
maximal arity of all the function symbols in T and n be the cardinality of T . A 
term t is represented in WSfcS using n + 1 set variables X and Xf,f £ J 7 , which are 
denoted by X in the following. X will be the set of all positions of t, and Xf will be 
the set of positions that are labeled with the corresponding function symbol. The 
following WSfcS formula expresses that X encodes a term in T{T) IjUomon 20001 
|Durand and Middeldorp 1997| ): 

Term AX) = f X = U?=i *f t A A i? y H X fj = 0) 
A Vx £ X Vy < x (y £ X) 

A A /e ^(Vx £ X f : l\Z ( P{x-l eX)A Af =ctr(/)+1 (aU £ *)) 

If Termjr{T) holds, then we let tf define the term in T{T) which is uniquely 
determined by Vos(t) = T and root{t\ p ) = f if p £ Tf for all p £ T. A subset of 
ground terms L C T(JF) is called WSfcS definable if there exists a WSfcS formula $ 
with free variables f such that L = {tf\ Term T (J) A $(T)}. 

An arbitrary term semantics S can be encoded as a relation S between terms: 
S = {{t,s) I t £ T(T) A s £ S(t)}. Hence, we say that semantics S is WSfcS 
definable if there exists a WSfcS formula $ with free variables T and S such that 
(t f , sg) £ 5 ^ Term r {f) A Termjr(S) A $(f , 5). 

Theorem 3 {Decidability of redundancy) 

Let S be a term semantics for a signature T. If S is WSfcS definable, then redundancy 
w.r.t. S is decidable. 

The following result shows that decidability of redundancy is antimonotone with 
regard to the ordering < on semantics. 

Proposition 2 

Let S, S' be term semantics for a signature T . If S < S', S' is WSfcS definable, and 
there exists a window set T C T(T) of S' w.r.t. S which is WSfcS definable, then S 
is WSfcS definable. 

In l|Dauchet et, al. 19901 IDauchet et al. 1987jl . ground (finite) tree transducers 
(GTT for short) were introduced to recognize the rewrite relation in (left- 
linear and right-)ground TRSs. Since GTT-recognizable relations are definable in 
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WSfcS IjComon 2000(1 . the semantics red is also WSfcS definable, hence the redun- 
dancy w.r.t. red is decidable. Now, the following result shows that the window set 
HNF-r, is WSfcS definable; this is useful for proving that semantics hnf is also WSfcS 
definable. 

Theorem 4 

The set HNF K of a finite left-linear, right-ground TRS TZ is WSfcS definable. 

Then, the following theorem provides the first decidability result w.r.t. all the 
non-trivial semantics considered in this paper. 

Theorem 5 (Decidability for semantics red^., hnf^, nf-^, and eval-R.) 

For a left-linear, right-ground TRS TZ over a finite signature T, the redundancy 

w.r.t. semantics red^, hnf^, nf?j, and eval^j is decidable. 

This result recalls the decidability of other related properties of TRSs, such 
as confluence, joinability, and reachability problems (for left-linear, right-ground 
TRSs) IjDauchet et al. 19871 |Oyamaguchi 1990| ). For instance, the confluence prob- 
lem was shown to be undecidable for right-ground TRSs, while it is decidable for 
ground TRSs and also for left-linear and right-ground TRSs IIDauchet et al. 1 987). 
Note that we cannot weaken in our approach the requirement of right-groundness 
in TheoremElto the more general conditions of shallowness IjComon 2000JI or grow- 
ingness ( |Jacquemard 19961 ) as the induced rewrite relations are not expressible in 
the logic WSfcS that we use to decide the property ( |Durand and Middeldorp 1997| ). 

In the following section we provide the first redundancy detection method, which 
(sufficiently) ensures that an argument is redundant in a given TRS. 



5. 1 Approximations of Redundancy 

Whenever a property is undecidable or costly to decide, we use approximations. 
A notion of approximation (for TRSs) that has been proven useful for approxi- 
mating interesting properties in term rewriting (namely neededness of redexes for 
normalization) is the following ( |Durand and M iddeldo rp 1997| |Jacquemard 1 996): 
Given TRSs TZ and TZ' (possibly with extra variables) over the same signature, TZ' 
approximates TZ if — and NF-r. = NF^/. An approximation of TRSs is a 
mapping a from TRSs to TRSs with the property that TRS a(TZ) approximates 
TRS TZ flDurand and Middeldorp 1997| ). We write TZ a instead of a(TZ) to denote 
the approximation of TZ according to a. Strong, nv ( |Durand and Middeldorp 1997| ), 
shallow HComon 2000f) . and growing ( |Jacquemard 1996| ) are examples of such ap- 
proximations of TRSs. In all these approximations, the rhs's of the rules are mod- 
ified in different ways. For instance, given a TRS TZ, TZ nv is obtained by replacing 
all variables in the rhs by new, different variables that do not occur in the lhs; this 
is possible since the framework deals with extra variables. 

In order to approximate redundancy, we need to use a new symbol f2 to represent 
all ground terms (in particular, to be used at the argument position which is tested 
for redundancy). Inspired by ( |Durand and M iddcldo rp~l997| [Oyamaguchi 1986), 
we define our notion of approximation as follows. Let TZ be a TRS over a signature 
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T and TZ' be a TRS over the signature J-U {^}, where is a new constant symbol 
defined by the rules {f2 — > /(f2) | / G J 7 }. We extend the approximation notion 
of ( |Durand and Middeldorp 1997| |Jacquemard 1996] ) naturally to TRSs over signa- 
tures T and J- U {£1}, where fHs a special symbol that potentially expresses any 
term. Note that we consider the normalization semantics only for ground terms. 
Thus, we say that TZ' approximates TZ (but notice that, now, TZ' is a TRS on 
FU{il}) if n (T(J-) x T{T)) C— ^, n(T(f)xT(f)) and NF TC = NF W -. Note 
that, -^C (Tf^^xT^-Y)) whereas -^C (T(T U {O}, ^)xT(f U {O}, A?)); 
however, by definition of TZ' , NF-r/ C T(fF). 
The following notation is auxiliary. 

Definition 2 (S-determinacy w.r.t. f and i) 

Given a symbol / G T and an argument i G {1, . . . , ar(/)}, we say that the seman- 
tics S is determined w.r.t. / and i if for every context C[] and t G T{J-) such that 
root(t) — /, then |S(C[t[fi]j])| < 1; where |A| stands for the cardinality of the set 
A. 

The following theorem provides a sufficient condition for redundancy which is 
the basis of our decidable approximations of redundancy. 

Theorem 6 {Approximation of redundancy) 

Let TZ = (J 7 , R) be a TRS, TZ' be an approximation of TZ, f G !F, i G {1, . . . , ar(f)}, 
and S G {eval,nf}. If TZ is S^-defined and Sn> is determined w.r.t. / and i, then 
i G rarg SK (f). 

It is an open problem whether redundancy is decidable for terminating TRSs. 
Nevertheless, Theorem [H] ensures that redundancy w.r.t. nf is approximable for 
terminating TRSs, since any terminating TRS TZ is nf-R.-defined. The following 
theorem ensures that WSfcS definability of a semantics entails the possibility of 
guaranteeing decidability of a given approximation. 

Theorem 7 (Decidability of S-determinacy w.r.t. f and i) 

Let S be a term semantics for a signature T U {ft}. If S is WSfcS definable, then it 
is decidable whether S is determined w.r.t. / and i. 

Remember that the semantics eval-R. and nf-jz are WSfcS definable for left-linear, 
right ground TRSs over finite signatures. This suggests us to use the following 
approximation of left-linear right-ground TRSs. Given TZ = [fF, R), we define TZ rg = 
(T U {fi}, R r g) as follows: 

R rg ={l^r Q \l^reR}U{n^ f(Q) | / G J-} 

where in is the term t with all variables replaced by Q. It is straightforward to 
see that rg is an approximation of TRSs. The following theorem ensures that S- 
determinacy w.r.t. / and i is decidable for an approximation TZ rg of a TRS TZ and 
semantics S G {nfrc,. g , eval Krg }. 
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Theorem 8 

Let K be a left-linear TRS, TZ rg be the approximation rg of TZ, f G T, i G 
{1, . . . , ar(/)}, and S £ {eval-R, rg , nfjj }. It is decidable whether S is determined 
w.r.t. / and i. 

By Theorems El and |H1 redundancy of an argument w.r.t. nf^ (and eval-R.) is 
effectively approximable by using rg. 

Corollary 1 [Approximation of redundancy for TZ rg ) 

Let TZ = (T, R) be a left-linear TRS, / G T, i G {1, . . . , ar(/)}, and S G {eval, nf}. 
If TZ is S-fc-defmed and S-ji rg is determined w.r.t. / and i, then i G rarg^(f). 

Example 3 

Consider the left-linear TRS TZ 

f(x.O) = f(0,s(y)) = s(0) f(s(x),s(y)) = g(x,y) 

g(x,y) = f(x,s(y)) 

Note that TZ is terminating, hence nf^-defmed. Approximation TZ rg is: 

f(x,0) = f(0,s(y)) = s(0) f(s(x),s(y)) = gCfi.fi) 

g(x, y ) = f(o,s(0)) n = fm,m n = 

n = s(n) n = o 

It is not difficult to see that r\fn rg is determined w.r.t. f and 1 whereas is not 
determined w.r.t. f and 2. It is possible to construct an automaton which tests those 
conditions, see e.g., ( |Thatcher and Wright 19681 ) f° r more details, thus making it 
automatically provable. By Theorem this means that 1 G rarg n ^(t ). 

The approximation rg is similar to nv of ( |Durand and Middeldorp 1997| ), that re- 
places every variable in rhs's by fresh ones. However, including the new symbol 
in the rhs's of the approximated program is essential for our development since the 
semantics of the program obtained by the approximation nv is not expressible in 
the logic WSfcS. 

In the following section, we address the redundancy analysis from a complemen- 
tary perspective. Rather than going more deeply in the decidability issues, we are 
interested in ascertaining conditions which (sufficiently) ensure that an argument 
is redundant in a given TRS. In order to address this problem, we investigate re- 
dundancy of positions. 

6 Redundancy of positions 

When considering a particular (possibly non-ground) function call, we can observe 
a more general notion of redundancy which allows us to consider arbitrary (deeper) 
positions within the call. 

Definition 3 (p- prefix- equal terms) 

We say that two terms (,s£ X) are p- prefix- equal, with p G Vos(t) n Vos{s) 
if, for all occurrences w with w < p, t\ w and s\ w have the same symbol at the root. 
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Definition 4 (Redundant position) 

Let S be a term semantics for a signature T and t S T(J-,X). The position p 6 
Vos(t) is redundant in t w.r.t. S if, for all t',s £ T(J-) such that £ and i' are 
p-prefix-equal, S(t') — S(t'[s] p ). 

We denote by rpos$(t) the set of redundant positions of a term t w.r.t. a semantics 
S. 

Note that the previous definition cannot be simplified by getting rid of t' and 
simply requiring that for all s G T(J-), S(t) = S(i[s] p ), mimicking Definition^ The 
reason is that positions in a term cannot be analyzed independently for redundancy 
if we want our notion of redundancy of positions to be truly compositional, as the 
following example shows. 

Example 4 

Let us consider the TRS 1Z: 

f (a, a) = a f(a,b) = a f(b,a) = a f(b,b) = b 

Given the term t = f (a, a), for all terms s <E ^"(^ r ), eval^(t[s]i) = eval^t) and 
eval-R.(t[s] 2 ) = eva\-jz(t). However, eval-R.(t[b]i [b] 2 ) / eval^(i). Indeed, 
1,2 £ rpos ev3 ^{t). 

In the following, we extend Theorem^a-nd Proposition ^ (which concern redun- 
dant arguments of function symbols) to redundant positions of terms. 

Theorem 9 (Antimonotonicity of redundancy of a position) 

Let S, S' be term semantics for a signature T . If S -< S', then, for all t £ T(T , X), 
rpos s ,(t) C rpos s (t). 

Proposition 3 (Non-redundancy of constructor positions) 

Let 1Z be a TRS such that |T(C)| > 1, and S be a rewriting semantics such that 
eval TC ^ S. Then, for all t e T(C,X), rpos s (t) = 0. 

The following result states that the positions of a term which are below the 
indices addressing the redundant arguments of any function symbol occurring in t 
are redundant. 

Proposition 4 

Let S be a term semantics for a signature T,t£ T(T, X), p € Vos(t), /eD. For all 
positions q,p' and i G rarg s (f) such that p = q.i.p' and root(t\ q ) = /, p £ rpos 5 (t) 
holds. 

In the following, we provide some general criteria for ensuring redundancy of 
arguments on the basis of the (redundancy of some) positions in the rhs's of program 
rules, specifically the positions of the rhs's where the arguments of the functions 
defined in the lhs's 'propagate' to. Theoremsn]a n d|ni sa y that the more restrictive a 
semantics is, the more redundancies there are for the arguments of function symbols. 
According to our hierarchy of semantics (by ^), eval seems to be the most fruitful 
semantics for analyzing redundant arguments. In the following, we focus on the 
problem of characterizing the redundant arguments w.r.t. eval. 



Removing Redundant Arguments Automatically 



13 



6. 1 Using Redundant Positions for Characterizing Redundancy: 

the Variable Case 

In this section, we focus on the problem of characterizing the redundant arguments 
w.r.t. eval-R. by studying the redundancy w.r.t. eval^ of some positions in the rhs's 
of program rules. The following definition is useful to detect whether the variables 
of the i-th argument in a lhs of symbol / propagate to positions in the rhs under 
the same i-th argument of symbol /. 

Definition 5 ((/, i) -redundant variable) 

Let / £ V, i € {l,...,ar(/)}, and t £ T(T,X). The variable x £ X is (f,i)- 
redundant in t if it occurs only at positions p £ Vos x (t) which (i) are redundant 
w.r.t. eval-K in t, i.e., p £ rpos eva \ (t), or (ii) they appear inside the i-th parameter 
of /-rooted subterms of t, i.e., 3q such that q.i < p and root(t\ q ) = f. 

Note that variables which do not occur in a term t are trivially (/, i) -redundant in 
t for any / £ T and i £ {1, . . . , ar(/)}. 

Example 5 

Consider the rules for symbol lastnew in Example ^ 

lastnew(x,nil,z) = z lastnew(x,y:ys,z) = lastnew(y ,ys ,z) 

Variable x is (lastnew, l)-redundant in rhs's n = z and r-i — lastnew(y,ys,z), 
since it does not appear in them. Variable ys is (lastnew, 2)-redundant in rhs f2, 
since it appears under the second argument of symbol lastnew. 

Now, we are able to provide the second effective method to determine redundant 
arguments based on the (/, i)-redundant variables occurring in rhs's. In order to 
prove Theorem 1101 below, we introduce some auxiliary definitions and lemmata. 

Given a TRS 71 = (J 7 , R), we write 71 f to denote the TRS 71 } = (J 7 , {I -> r £ 
R | root(l) = /}) which contains the set of rules defining / £ V. The following 
definition provides the set of positions of the z-th parameter of / symbols in t. 

Definition 6 

Let f £ J 7 , i £ {1, . . . ,ar(f)}, and t £ T{F,X). We define Pos fti (t) = {q.i £ 
Vos(t) | root(t\ q ) = /}. 

Let I = tx, . . . , t n be a sequence of terms, P = p\ , . . . , p n be a sequence of positions 
of another term s, and P' = p[, . . . ,p' m be a subsequence of P (i.e., m < n and 
3/i : {1, . . . , m} — * {1, . . . ,n} such that p[ = p^u) and i < i' => fi(i) < /i(i')), we 
denote t\p> = t[, . . . , t' m such that t\ — t^uy The following result is auxiliary and 
proves that the same constructor term is obtained by rewriting when we replace the 
set of subterms at eval^-redundant and Posf^ positions in a term by an arbitrary 
set of terms. 

Proposition 5 

Let Kbe a left-linear CS, f £ V, and i £ {1, . . . , ar(f)}. Let t £ T(T), P C 
PoSf t i(t) U rpos ev3 i n (t) be a set of disjoint positions, and s £ T(!F). Let t — ►* S for 
some 8 £ T(C). If, for alH — > r £ TZf, l\i is a variable which is (/, i)-redundant in 
r, then t[s]p S. 

Now, we provide the second effective method to detect redundancy. 
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Theorem 10 [Detecting redundancy: the Variable Case) 

Let K be a left-linear CS. Let / € T> and i € {1, . . . , ar(f)}. If, for all / -> r e %, 
Z|j is a variable which is (/, i)-redundant in r, then i € 7W<7 e val-» (/)• 

Example 6 

A standard example in the literature on useless variable elimination (UVE) — 
a popular technique for removing dead variables, see llWand and Siveroni 19991 
|Kobay ashi 2000| — is the following program 4 with constructor symbols C = {0,s} 
and variables a, bogus, and j: 

loop(a, bogus, 0) = loop(s (a) , s (bogus) , s (0) ) 
loop(a, bogus, s(j)) = a 

Here it is clear that the second argument does not contribute to the value of the 
computation. By Theorem 1101 the second argument of loop is redundant w.r.t. 
eval TC . 

The restriction to left-linear rules in Theorem ^| above is not strictly necessary; 
however, in most practical cases the redundancy of the argument of symbol / cannot 
be analyzed independently when we consider repeated variables in left-hand sides, 
as witnessed by the following example. 

Example 7 
Consider the TRS 11: 

f(x,x) = a 

where f and a are the only function symbols in the signature. Since every ground 
term t rewrites to a (this can be easily proved by structural induction) , both argu- 
ments of f are redundant w.r.t. eval-R.. However, if we add a new constant symbol 
b, then no argument of f is redundant anymore. 

The following example demonstrates that the restriction to constructor systems in 
Theorem 1101 is also necessary. 

Example 8 

Consider the following non-constructor TRS 1Z where C = {a, b}: 

f(a,x) = g(f(b,x)) g(f(b,x)) = x 

Then, the second argument of f (a,x) in the lhs of the first rule is a variable 
which, in the corresponding rhs of the rule, occurs within the second argument of 
a subterm rooted by /, namely f (b,x). Hence, by Theorem llOl we would have that 
2 £ rar^ eva | TC (f ). However, eval^(f (a, a)) = {a} ^ {b} = eval TC (f (a,b)), which 
contradicts 2 £ rarg evai ^ (f ). 

Moreover, the extension of this result to the normalization semantics nf is not 
possible, as shown in the following example. 

4 The original example uses natural 100 as stopping criteria for the third argument, while we 
simplify here to natural 1 in order to code it only with two rules. 
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Example 9 

Consider the TRS TZ where C = {a,b}: 
f(a,x) = a 

This TRS satisfies the conditions of Theorem 1101 and then 2 £ rarg eva \ (f). In 
concrete, we have that, for all s, eval-R,(f (b,s)) = 0. However, nf^(f (b,a)) = 
{f (b,a)} j4 {f (b,b)} = nf K (f (b,b)). 

Now, we are able to detect some redundancies in Example ^ 

Example 10 

Let us revisit the following rules from the CS TZ of Example Q 

lastnew(x,nil,z) = z lastnew(x,y:ys,z) = lastnew(y,ys,z) 

Using Theorem I1UI we are able to conclude that the first argument of function 
lastnew is (trivially) redundant w.r.t. eval^, since, in every lhs, the first parameter 
of lastnew is a variable that is (lastnew, l)-redundant in the respective rhs. 

Unfortunately, Theorem 1101 does not suffice to prove that the second argument 
of lastnew is redundant w.r.t. eval-^,, and this motivates the next section. 

6.2 Using Redundant Positions for Characterizing Redundancy: 

the Pattern Case 

In the following, we provide a different sufficient criterion for redundancy which 
is less demanding regarding the shape of the left hand sides, although it requires 
confluence and evalK-definedncss, in return. The following definitions arc helpful to 
determine the redundancy of argument i of / when / is defined by 'matching cases' 
for the argument i in the different rules. 

Definition 7 

Let J 7 be a signature, t = f(t\,...,tu), s = /(si,...,Sfe) be terms and I 6 
{l,...,fc}. We say that t and s unify up to i-th argument with mgu a if 
(ii, . . . ,ti_i, t i+ i, ...,t k ) and (sx, . . . , s<_i, s i+ x, . . . , s k ) unify mith mgu a. 

Definition 8 ((/,i) -triple) 

Let TZ = (J 7 , R) be a TRS, / GJF, and i g{l, . . . , ar(f)}. Given two different (possibly 
renamed) rules I — > r, I' — > r' in TZf such that Var(l) D Var(V) — 0, we say that 
(I — > I' — ► r', cr) is an (/, i)-triple of 72. if Z and Z' unify up to i-th argument with 
mgu a. 

Example 11 

Consider the TRS TZ from Example^ This program has a single (lastnew, 2)-triple: 
( lastnew (x, nil ,z)=z, lastnew (x' ,y : ys ,z ' )=lastnew(y ,ys ,z'), [xhx^zhz 1 ]) 

The following definition allows us to consider rules for symbol / which are "se- 
mantically equivalent" after replacing some variables and i-parameters in their rhs's. 
The basic idea is to check joinability of the (/, i)-triples of Definition |H1 where vari- 
ables below the i-th argument of symbol / in the left-hand sides of the rules of the 
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triple are explicitly instantiated by a dummy symbol a (Definition below) . Intu- 
itively, joinabilty of (all) such triples, then, amounts at proving the z-th argument 
of / as redundant fTheorem II II below) . 

In the following, we will use notation t either for a fc-tuple of terms t\ , . . . , th 
or for a sequence of a unique term t, . . . , t; the distinction will be clarified by the 
context. 

Definition 9 {Joinable ( / ', i) -triple) 

Let 1Z be a TRS, / € V, and i G {1, . . . , ar(f)}. Let a be an arbitrary constant. 
An (/, j)-triple (I — ► r,l' — > r',a) of TZ is joinable if <J C {Ti(r)) and crc(' r i'( r '')) are 
joinable (i.e., they have a common reduct). Here, substitution o~c is given by: 



o-c{x) 



a(x) if x g" Var(l\i) U Var(l'\ 
a otherwise 



n(t) 



and transformation r; is given by 
t ifl\ t EX 

J[a] Q if l\i g X and Q = {p 6 Pos fti (t) \ Var(t\ p ) n Var(/|,) ^ 0} 

Note that the constant a in the previous definition can be replaced by any ground 
term. In the case of many-sorted signatures, we would consider different constants 
'a', one for each sort. 

Example 12 

Consider again the CS 1Z in Example ^ and the single (lastnew, 2)-triple given in 
Example II II Let us call the rhs's 

r*i = z and T2 = lastnew (y , ys , z ' ) 

for the lh's l\ = lastnew(x,nil,z) and li = lastnew(x' ,y:ys,z'). Let us con- 
sider that is the constant for the sort of the first argument of lastnew and nil 
is the constant for the sort of the second argument of lastnew. The corresponding 
transformed rhs's are 

T h(ri) — z and 77 2 (r2) = lastnew (y, nil, z'). 

With a = [x i ► x',z z'] and oq = [x x' , z z ' , y 0, ys i— > nil], the 
corresponding instantiated rhs's are 

&c{t~Ii i r i)) — z ' an d <Jc{n 2 { r 2)) = lastnew(0,nil,z' ). 

We can prove (^(t/j (ri)) an d &c( T i 2 ( r 2)) are joinable, since the variable z' is the 
common reduct. Hence, the considered (lastnew, 2)-triple is joinable. 

Roughly speaking, the result below formalizes a method to determine redundancy 
w.r.t. eval-R, which is based on finding a common reduct of (some particular instances 
of) the right-hand sides of rules. 

Definition 10 ((/, i)- joinable TRS) 

Let 1Z be a TRS, / E T, and i G {1, . . . , ar(f)}. 1Z is (/, i) -joinable if, for all 
/ — > r £ TZf and x G Var(l\i), x is (/, z)-redundant in r and all (/, i)-triples of 1Z 
are joinable. 
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The following result is auxiliary for Theorem 1111 and proves that the same con- 
structor term is obtained by rewriting when we replace the set of subterms at 
eval-R-rcdundant and Posf^ positions in a term by an arbitrary set of terms. 

Proposition 6 

Let TZ be a left-linear, confluent, and eval^-defined CS. Let / G V and i 6 
{1, . . . , ar(f)}. Let t G ^"(-T 7 ), P Q Pos/,j(i) U rpos eml ^(t) be a set of disjoint 
positions, and a be a constant. Let t — ►* S for some S G T(C). If TZ is (/, i)-joinable, 
then t[a] P — >* S. 

Now, we provide the third effective method to detect redundancy. 
Theorem 11 (Detecting redundancy: the Pattern Case) 

Let TZ be a left-linear, confluent and eval^-defincd CS. Let / G T) and 
i € {1,.. . ,ar(f)}. HTZ is (/, z)-joinable, then i € rarg eval ^(f). 

Confluence and eval-R.-definedness are necessary, as shown in the following examples. 

Example 13 

Consider the following non-confluent CS TZ: 

f(0) = f(s(x)) = g(f(x)) g(x) = g(x) = s(0) 

By Theorem 1111 we would have 1 G rarg eval ^ (f ), since the (f,l)-triple 
(f (0)=0, f (s (x) )=g(f (x) ), id) is joinable, i.e., the common reduct of terms 
and g(f (0)) is 0. However, eval TC (f (0)) = {0} ^ {0, s(0)} = eval K (f (s (0) )). 

Example 14 

Consider the following non-eval^-defined CS TZ: 

f(0) = f(s(x)) = f(x) g(s(0)) = 

By Theorem ITTl we would have 1 G rarg sm ^(±). But eval K (f (0)) = {0} ^ = 
eval K (f (g(0))). 

Joinability is decidable for terminating, confluent TRSs as well as for other 
classes of TRSs such as right-ground TRSs dOyamaguchi 1990| ) and confluent semi- 
constructor TRSs ( Mitsuha shi et al. 2004j) (a semi-constructor TRS is such a TRS 
that every subterm of the rhs of each rewrite rule is ground if its root is a defined 
symbol). Hence, Theorem 1111 gives us an effective method to recognize redundancy 
in completely defined, confluent, and (semi-)complete TRSs, as illustrated in the 
following. 

Example 15 

Consider again the CS TZ of Example ^ This program is confluent, terminating 
and completely defined (considering sorts), hence is eval^-defined. By Example 1 101 
the first argument of lastnew is redundant w.r.t. eval^, using Theorem 1101 Now, 
the second argument of lastnew is redundant w.r.t. eval-R. using the new Theorem 
1111 As a consequence, the positions of variables x and xs in the rhs of the first 
rule of applast have been proven redundant. Then, since both lastnew (0, nil, z) 
and z rewrite to z, 7?. a ppiast is (applast, l)-joinable. And again by Theorem II II 
we conclude that the first argument of applast is also redundant. Hence, 1 G 

rar 5eval K ( a PP last ) and 1,2 6 ™»"5eval TC ( lastnew )- 
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Table 1. Summary of Results 



Semantics 


Theorem 


Requirements 


S 


Th. (Antimonotonicity) 




S 


Prop. Q (Non-redundancy) 




{red, hnf, nf , eval} 


Th. (Decidability) 


LL, RG 


{nf , eval} 


Coro. (Approximation 7L rg ) 


LL, ND, NDT (ED, EDT) 


S 


Th. |U] (Antimonotonicity - positions) 




S 


Prop. |3] (Non-redundancy - positions) 




eval 


Th. ESI (The Variable Case) 


CS, LL, VR 


eval 


Th.[TT](The Pattern Case) 


C, CS, ED, LL, JT 



C : 


Confluence 


LL 


CS : 


Constructor System 


ND 


ED : 


eva l-dcfinedness 


NDT 


EDT : 


eva 1 7j rs -determinacy 
Joinability of (/, i)-triples 


RG 


JT : 


VR 



Left-Linearity of the TRS 

nf-definedness 

nf iz rg -determinacy 

Right-ground TRS 

Variables in l\i are (/, i)-redundant in r 



Let us conclude with a few general remarks about the complexity of our approach, 
that is, the analysis time to detect redundant arguments (the cost of performing 
the optimizations proposed in Section \7\ is negligible) . In Table E] we provide a 
summary of the main results in the paper. Theorem I1UI only requires syntactic 
properties which can be tested in linear time on the size of the TRS (i.e., on the 
sum of sizes of each rule, where the size of a rule is the sum of sizes of the left- 
and right-hand sides). The conditions LL, C and ED in the premises of Theo- 
rem 1111 are standard properties of rewrite systems (as remarked in Section [3] a 
TRS 1Z is ED if 1Z is normalizing and completely defined, but there is no di- 
rect way to check whether a TRS is normalizing and then termination is required) 
and then assumed to be fulfilled by the TRS 7Z and checked apart. The com- 
plexity of such properties for decidable cases has been investigated elsewhere (see, 
e.g., ( |Godoy and Tiwari 2004| |Kapur et al. 1987| IVerma 2002|) ) and a number of 
tools are available for checking them in practice: For instance, termination tools 
such as AProVE (|Giesl et al. 2004jl and CzME ( |Contejean et al. 2003| ), conflu- 
ence checking tools such as CzME, and tools for ensuring completely-definedness 
such as Sec <|Hendrix et al. 2005jl . Thus, the only property which is strictly new 
in our framework is JT. As we mentioned above, joinability is decidable for sev- 
eral classes of TRSs JGodoy and Tiwari 2004) IVerma 20021 IMitsuhashi et al. 2004)1 . 
Actually, there are (cubic) polynomial time algorithms for joinability of ground 
systems l|Verma 20021 Theorem 12) and a slightly more general class of TRSs is 
considered in ( |Godoy and Tiwari 2004| ), namely right- (ground or variable) rewrite 
systems. In our implementation however, confluence and termination of the TRS 
are assumed for the application of Theorem (see above) and then joinability of 
terms t and s is decidable by just checking whether the normal forms of t and s are 
equal. 



Removing Redundant Arguments Automatically 



19 



7 Erasing Redundant Arguments 

The presence of redundant arguments within input expressions wastes memory 
space and can lead to time consuming explorations and transformations (by re- 
placement) of their structure. Then, since redundant arguments are not necessary 
to determine the result of a function call, it is worth to develop methods and tech- 
niques to avoid such unpleasant effects. 

As remarked in the introduction, inefficiencies caused by the redundancy of argu- 
ments cannot (in general) be avoided by using rewriting strategies. In this section 
we formalize a procedure for removing redundant arguments from a TRS. The basic 
idea is simple: if an argument of / is redundant, it does not contribute to obtaining 
the value of any call to / and can be dropped from program 1Z. Hence, we remove 
redundant formal parameters and corresponding actual parameters for each func- 
tion symbol and function call in 1Z. We begin with the notion of syntactic erasure 
which is intended to pick up redundant arguments of function symbols. 

Definition 11 {Syntactic erasure) 

A syntactic erasure is a mapping p : T — ► V(N) such that for all / € J-, p(f) C 
{1, . . . , etr(/)}. We say that a syntactic erasure p is sound for a semantics S if, for 
all / e T, p(f) C rarg s (f). 

Example 16 

Given the signature T = {0, nil, s, : , applast, lastnew} of the TRS 1Z in Example 
^ with ar(0) = ar(nil) = 0, ar(s) = 1, ar( : ) = ar(applast) = 2, and etr(lastnew) = 
3, and according to Example 1 151 the following mapping p is a sound syntactic era- 
sure for the semantics eval^: p(0) = p(nil) = p(s) = p(:) = 0, p(applast) = 
{1}, and p(lastnew) = {1,2}. 

Since we are interested in removing redundant arguments from function symbols, 
we transform the functions by reducing their arity according to the information 
provided by the redundancy analysis, thus building a new, erased signature. 

Definition 12 (Erasure of a signature) 

Given a signature T and a syntactic erasure p : T ' — » P(N), the erasure of T is the 
signature T p whose symbols f p E T p are one to one with symbols / 6 T and whose 
arities are related by ar(f p ) = ar(f) — \p(f)\- 

Example 17 

The erasure of the signature in Examplc ll6l is T p = {0, nil, s, : , applast, lastnew}, 
with ar(0) = ar(nil) = 0, ar(s) = ar(applast) = ar(lastnew) = 1, and ar( : ) = 2. 
Note that, by abuse, we use the same symbols for the functions of the erased 
signature. 

Now we extend the procedure to terms in the obvious way. 
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Definition 13 [Erasure of a term) 

Given a syntactic erasure p : T — » "P(N), the function r p : T(T ', X) — > T(T P , X) on 
terms is: t p (x) = s if i G ^ and T p (f(ti, . . . ,t„)) = /pCr^J, . . . ,t p (^J) where 
{1, ...,n}- p(f) = {ii, . . .,i k } and i m < i m+ i for 1 < m < k. 

The erasure procedure is extended to TRSs: we erase the lhs's and rhs's of each rule 
according to r p . In order to avoid extra variables in rhs's of rules (that arise from 
the elimination of redundant arguments of symbols in the corresponding lhs), we 
replace them by an arbitrary constant of T (which automatically belongs to T p ). 

Definition 11± (Erasure of a TRS) 

Let 1Z be a TRS, a a constant, and p be a syntactic erasure for T . The erasure 1Z P 
of 1Z is 1Z P = (Tp, {t p (1) — > <7i(r p (r)) | I-tr£ R}) where the substitution ai for a 
lhs I is given by o~i(x) = a for all x G Var(l) — Var(r p (l)) and o~i(y) = y whenever 
2/ S Var(T p (l)). 

Note that the constant a in the previous definition can be replaced by any ground 
term. In a many-sorted signature, we will have different constants 'a', each one of 
an appropriate sort. 

Example 18 

Let 1Z be the TRS of Example Q]and p be the sound syntactic erasure of Example 
[TBI The erasure 1Z P of 1Z consists of the erased signature of Example El together 
with the following rules: 

applast(z) = z lastnew(z) = z 

applast(z) = lastnew(z) lastnew(z) = lastnew(z) 

Below, we introduce a further improvement aimed at obtaining the final, "optimal" 
program. 

The mapping r p induces an equivalence = Tp on terms given by: t = Tp s iff r p (t) = 
T p (s). We have the following property of sound erasures of terms. 

Proposition 7 

If the syntactic erasure p : T — * V(N) is sound with respect to the semantics S, 
then for all t, s € T(!F) : t = Tp s implies that S(t) = S(s). 

The following theorem establishes the correctness of the erasure procedure for a 
rewriting semantics S. 

Theorem 12 (Correctness) 

Let 1Z be a left-linear TRS, S be a rewriting semantics for 7Z, p be a sound syntactic 
erasure for S, and t G T(T). If 8 € S(£), then r p (t) t p (<5). 

The following theorem establishes the completeness of the erasure procedure for 
a rewriting semantics S. 

Theorem 13 (Completeness) 

Let 1Z be a left-linear TRS, S be a rewriting semantics for 1Z such that S ^ red^, p 
be a sound syntactic erasure for S, and t, 5 G T(T P ). If t 5, then W, 8' G 
such that Tp (t') = t and r p (<5') = 8, S(8') C S(f). 

The following theorem establishes the correctness and completeness of the erasure 
procedure for the semantics eval^. 
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Theorem 14 (Correctness and Completeness) 

Let TZ be a left-linear TRS, p be a sound syntactic erasure for eval^, t £ T(T), and 
S e T(C). Then, Tp (t) ->* Up 5 iff 5 e eval TC (t). 

In the following, we are able to ascertain the conditions for the preservation of 
some computational properties of TRSs after erasure. 

Theorem 15 (Preservation of Confluence) 

Let TZ be a left-linear TRS. Let p be a sound syntactic erasure for eval-R,. If TZ is 
eval^-defined and confluent, then the erasure 1Z P of TZ is confluent. 

Theorem 16 (Preservation of Normalization) 

Let TZ be a left-linear and completely defined TRS, and p be a sound syntactic 
erasure for eval-^,. If TZ is normalizing, then the erasure TZ P of TZ is normalizing. 

In the theorem above, we cannot strengthen normalization to termination. A 
simple counterexample showing that termination may get lost is the following; 
note that the opposite is also possible, i.e., a non-terminating TRS can be made 
terminating after the erasure. 

Example 19 

Consider the left-linear, (confluent, completely defined, and) terminating TRS TZ 
h(a,y) = a h(c(x),y) = h(x,c(y)) 

The first argument of h is redundant w.r.t. eval^. However, after erasing the argu- 
ment, we get the TRS 

h(y) = a h(y) = h(c(y)) 

which is not terminating. 

In the example above, note that the resulting TRS is not orthogonal, whereas 
the original program is. Hence, this example also shows that orthogonality is not 
preserved under erasure. 

After the erasure, a post-processing transformation able to remove redundant 
rules (w.r.t. an appropriate notion of rule redundancy) might be useful to restore 
termination or orthogonality in some cases, as the example above. Although this 
point is outside the scope of this paper, in the following we provide a program 
transformation that can improve the optimization achieved by the erasure. 

Definition 15 (Reduced erasure of a TRS) 

Let TZ be a TRS and p be a syntactic erasure for T . The reduced erasure TZ' p of TZ 
is obtained from the erasure TZ P of TZ by a compression transformation defined as 
removing any trivial rule t — > t of TZ P and then normalizing the rhs's of the rules 
w.r.t. the non-trivial rules ofTZ p . 

Reduced erasures are well-defined whenever TZ P is confluent and normalizing 
since, for such systems, every term has a unique normal form. 
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Example 20 

Let TZp be the erasure of Example The reduced erasure consists of the rules 
{applast(z) = z, lastnew(z) = z}. 

Since right-normalization preserves confluence, termination and the equational 
theory (as well as confluence, normalization and the equational theory, in almost 
orthogonal and normalizing TRSs) IjGramlich 2001J1 . and the removal of trivial rules 
does not change the evaluation semantics of the TRS TZ either, we have the follow- 
ing. 

Corollary 2 

Let TZ be a left-linear TRS, p be a sound syntactic erasure for eval^, t € T(!F) 1 
and 5 £ 1~(C). If (the TRS which results from removing trivial rules from) 7Z P is 
confluent and terminating (alternatively, if it is almost orthogonal and normalizing) , 
then, T p (t) —*Tn & ^ an< ^ on ^y ^ ft ^ eva It^ ) , where TZ' p is the reduced erasure of 
TZ. 

Erasures and reduced erasures of a TRS preserve left-linearity. For a TRS TZ satis- 
fying the conditions in Corollary |2J by using ( Gramlic h 200 ljl . it is immediate that 
the reduced erasure 7Z' p is confluent and normalizing. Also, TZ' p is completely defined 
if TZ is. 

Hence, let us note that these results allow us to perform the 'optimal' optimization 
of program applast in Examplcnwhile guaranteeing that the intended (evaluation 
or normalization) semantics is preserved. 



8 Experiments 

The practicality of our ideas is witnessed by the implementation of a prototype 
system which delivers encouraging good results for the techniques deployed in Sec- 
tional (Theorems^Jand^J) and the erasure procedure of Section[7| The prototype 
has been implemented in PAKCS IjHanus et al. 2003a}) . the current distribution 5 of 
the multi-paradigm declarative language Curry (|Hanus et al. 2003b|l . and is publicly 
available at http://www.dsic.upv.es/users/elp/redargs 

We have used the prototype to perform some preliminary experiments which show 
that our methodology does detect and remove redundant arguments of some com- 
mon transformation benchmarks, such as bogus, lastappend, allzeros, 
doubleflip, etc.; see (jLeuschel 1998fl and references therein. Tables [5] and [3] sum- 
marize the experiments. Benchmarks code as well as the programs obtained by the 
erasure procedure are included in |Appendix B| 

Table |21 shows the execution runtimes of the original and transformed programs 
in PAKCS, as well as the arguments in the whole program which are signaled as 
redundant for each benchmark using the notation: #signaled/#total. Runtimes 
have been measured in an "AMD Athlon XP" class machine running Fedora Core 3.0 
and using version 1.6.0 of the PAKCS compiler under SICStus Prolog 3.8.6. Natural 



5 See http://www.informatik.uni-kiel.de/~pakcs 
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Table 2. Execution of the original and transformed programs in Curry 



Name 


Call in original/erased program 




Time (ms) 


Gain 




bogus 


loop (fact 8) (fact 9) (fact 
loop' (fact 8) (fact 8) 


8) 


150 
150 


0% 


1/1 


applast 


applast [(fact 8)]ioooo (fact 
applast' (fact 8) 


8) 


168 
153 


9% 


3/3 


plus_minus 


minus_pe (fact 8) (fact 8) 
minus_pe' (fact 8) 




zzU 
155 


30% 


1/1 


plusJeq 


leq_pe (fact 8) (fact 8) 
leq_pe ' 




79 
~0 


100% 


1/1 


double_even 


even_pe (fact 8) 
even_pe ' 




77 
~0 


100% 


1/1 


sum_allzeros 


sum_pe [(fact 8)]ioooo 
sum_pe ' 




23 
~0 


100% 


1/1 


Mutual recursion 1 


f (fact 8) (fact 8) 
f ' 




123 
~0 


100% 


1/1 


Mutual recursion 2 


f (fact 8) 
f ' 




132 
~0 


100% 


1/1 



numbers are given by numbers 0, 1, 2, etc in the tables, instead of the notation Z/S 
x used in the bechmarks code. For benchmarking purposes, goals make use of the 
auxiliary factorial function, defined in a usual way. The number of elements of a 
list (when used) is indicated by a subindex. Note that the analysis time for each 
example is negligible. 

Important optimizations are obtained for most examples. In the case of program 
bogus, no appreciable optimization is achieved by removing redundant arguments, 
since Curry is a lazy language and the redundant argument in bogus is a useless 
variable. In order to dissociate the possible dependency of the achieved optimization 
w.r.t. the lazy evaluation of the language, Table 01 shows the execution runtimes of 
the benchmarks in the Maude interpreter 6 (version 2.1.1), which uses an innermost 
rewriting strategy. 

Note that, in this case, significant optimizations are also measured for programs 
bogus and applast. The plus_minus example runs in nearly half the original exe- 
cution time in both, lazy and eager systems, which seems consistent with the fact 
that one of the two arguments have been removed. 
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Table 3. Execution of the original and transformed programs in Maude 



Name 


Call in original/erased pro; 


jram 




Time (ms) 


Gain 




bogus 


loop (fact 8) (fact 9) 
loop' (fact 8) (fact 8) 


(fact 


8) 


651 
47 


93% 


1/1 


applast 


applast [(fact 8)]ioooo 
applast' (fact 8) 


(fact 


8) 


102 

54 


47% 


3/3 


plus_miiius 


minus_pe (fact 8) (fact 
minus_pe' (fact 8) 


8) 




30 


51% 


1/1 


plusjeq 


leq_pe (fact 8) (fact 8) 
leq_pe ' 




33 
~0 


100% 


1/1 


double_even 


even_pe (fact 8) 
even_pe ' 






32 
~0 


100% 


1/1 


sum_allzeros 


sum_pe [(fact 8)]ioooo 
sum_pe ' 






40 
~0 


100% 


1/1 


Mutual recursion 1 


f (fact 8) (fact 8) 
f ' 






73 
~0 


100% 


1/1 


Mutual recursion 2 


f (fact 8) 
f ' 






61 
~0 


100% 


1/1 



9 Related Work 

Some notions have appeared in the literature of what it means for a term in a TRS 1Z 
to be "computationally irrelevant" . As we are going to see, our analysis is different 
from all the related methods in many respects and, in general, incomparable to 
them. 

Contrarily to our notion of redundancy, the meaninglessness of ( |Kuper 1994| 
|Kennaway et al. 1996| ) is a property of the terms themselves (they may have mean- 
ing in 1Z or may not), whereas our notion refers to arguments (positions) of function 
symbols. In ( |Kuper 1994[ Section 7.1), a term t is called meaningless if, for each 
context C[ ] s.t. C[t] has a normal form, we have that C[t'] has the same normal 
form for all terms t' . This can be seen as a kind of superfluity (w.r.t. normal forms) 
of a fixed expression in any context, whereas our notion of redundancy refers to the 
possibility of getting rid of some arguments of a given function symbol with regard 
to some observed semantics. The meaninglessness of (Kupcr 1994) is not helpful 
for the purposes of optimizing programs by removing useless arguments of function 
symbols which we pursue. On the other hand, terms with a normal form are proven 
meaningful (i.e., not meaningless) in (Kupcr 1994, |Kennaway et al. 1996| ), whereas 
we might have redundant actual parameters which are normal forms. 

Among the vast literature on analysis (and removal) of unnecessary data struc- 
tures, the analyses of unneededness (or absence) of functional programming QCousot and Cousot 19941 
Hughes 19880 , and the filtering of useless arguments and unnecessary variables of 
logic programming (Leusc hel and S0rensen 19961 IPettorossi and Proietti 1994J1 are 
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the closest to our work. In (Hughes 1988), a notion of needed / unneeded parameter 
for list-manipulation programs is introduced which is closely related to the redun- 
dancy of ours in that it is capable of identifying whether the value of a subterm 
is ignored. The method is formulated in terms of a fixed, finite set of projection 
functions which introduces some limitations on the class of neededness patterns 
that can be identified. Since our method gives the information that a parameter is 
definitely not necessary, our redundancy notion implies Hughes's unneededness, but 
not vice versa. For instance, constructor symbols cannot have redundant arguments 
in our framework (Proposition U, whereas Hughes' notion of unneededness can be 
applied to the elements of a list, as shown in the following example. 

Example 21 

Consider the following TRS defining the length function for lists. 

length(nil) = length (x : xs) = s (length(xs) ) 

Hughes' analysis is able to determine that, in the length function, the spine of the 
argument list is needed but the elements of the list are not needed; this is used to 
perform some optimizations for the compiler. However, this information cannot be 
used for the purposes of our work, that is, to remove these elements when the entire 
list cannot be eliminated. 

On the other hand, Hughes's notion of neededness /unneededness should not be 
confused with the standard notion of needed (positions of) redexes of ( |Huet and Levy 1991| ): 
Example[21shows that Huet and Levy's neededness does not imply the non-redundancy 
of the corresponding argument or position (nor vice versa). 

The notion of redundancy of an argument in a term rewriting system can be 
seen as a kind of comportment property as defined in l|Cousot and Cousot 1 994). 
Cousot's comportment analysis generalizes not only the unneededness analyses but 
also strictness, termination and other standard analyses of functional programming. 
In l|Cousot and Cousot 1994jl . comportment is mainly investigated within a deno- 
tational framework, whereas our approximation is independent from the semantic 
formalism. 

Proietti and Pettorossi's elimination procedure for the removal of unnecessary 
variables is a powerful unfold/fold-based transformation procedure for logic pro- 
grams; therefore, it does not compare directly with our method, which would be 
seen as a post-processing phase for program transformers optimization. Regarding 
the kind of unnecessary variables that the elimination procedure can remove, only 
variables that occur more than once in the body of the program rule and which do 
not occur in the head of the rule can be dropped. This is not to say that the transfor- 
mation is powerless; on the contrary, the effect can be very striking as these kinds of 
variables often determine multiple traversals of intermediate data structures which 
are then removed from the program. Our procedure for removing redundant ar- 
guments is also related to the Leuschel and S0rensen RAF and FAR algorithms 
I ILeuschel and S0rensen 19961 ), which apply to removing unnecessary arguments in 
the context of (conjunctive) partial evaluation of logic programs. However, a com- 
parison is not easy either as we have not yet considered the semantics of computed 
answers for our programs in detail. 
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People in the functional programming community have also studied the problem 
of useless variable elimination (UVE). Apparently, they were unaware of the works 
of the logic programming community, and they started studying the topic from 
scratch, mainly following a flow-based approach IjWand and Siveroni 1999| or a 
type-based approach IjBerardi et al. 2 000 ; Kobayashi 2000 ) ; sec ( Bcr ardi et al. 200"0jl 
for a discussion of this line of research. All these works address the problem of safe 
elimination of dead variables but heavily handle data structures. A notable excep- 
tion is (|Lin and Stoller 2002|l . where Liu and Stoller discuss how to safely eliminate 
dead code in the presence of recursive data structures by applying a methodology 
based on regular tree grammars. Unfortunately, the method in l|Liu an d Stollc r~2002jl 
does not apply to achieve the optimization pursued in our running example applast. 

Obviously, there exist examples (inspired) in the previously discussed works which 
cannot be directly handled with our results. 

Example 22 

Consider the TRS of Example 1211 together with the following function symbol f : 

f(x) = length(x:nil) 

Our methods do not capture the redundancy of the argument of f . In (|Lin and Stoller 2 002 ) 
it is shown that, in order to evaluate length(xs), we do not need to evaluate 
the elements of the argument list xs; as Hughes's unneededness. In Liu et al.'s 
methodology, this means that we could replace the rule for f above by the rule 
f (_) = length(_:nil) where _ is a new (dummy) constant. Nevertheless, the new 
TRS can be used now to recognize the first argument of f as redundant. That is, we 
are allowed to use the following rule f = length (_: nil) which completely avoids 
wasteful computations on redundant arguments. Hence, the different methods are 
complementary and an enhanced test might be developed by properly combine 
them. 



10 Functional Logic Programming: Narrowing 

Programs written in muti-paradigm functional-logic languages such as Curry (see 
e.g. those in |Appendix B| ) are usually not different from (equivalent) programs writ- 
ten in the (pure) functional language Haskell. The difference only shows up during 
the evaluation. In Curry, one can evaluate expressions containing logical variables 
(that are evaluated non-deterministically to deliver computed answers as in Pro- 
log) while in Haskell only completely ground expressions can be (deterministically) 
evaluated to compute its value. In fact, Term Rewriting Systems are also used as 
abstract models of programs written in such languages, although narrowing, rather 
than rewriting, is usually the underlying computational mechanism l|Hanus 1 994). 

Before the conclusions, let us discuss how the notions and techniques presented so 
far could be adapted to cope with more sophisticated, multi-paradigm functional- 
logic languages. The most popular operational principle to deal with logical vari- 
ables within function calls is known as narrowing, as used in functional logic 
programming (see (Hanus 1994) for a survey). Narrowing is an unification-based, 
parameter-passing mechanism which extends functional evaluation through goal 
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solving capabilities as in logic programming. A narrowing step instantiates variables 
of an expression and then applies a reduction step to a redex of the instantiated ex- 
pression. The instantiation of variables is usually computed by unifying a subterm 
of the entire expression with the left-hand side of some program equation. Narrow- 
ing provides completeness in the sense of logic programming, i.e., computation of 
answers, as well as functional programming, i.e., computation of normal forms. For- 
mally, a term s narrows to t in 1Z, denoted by s i, iff there exists a non-variable 
position p of s, a (standardized apart) rule I — > r G 1Z, and a substitution a such 
that s\ p and I unify with mgu a and t = <j(s[r] p ). 

Narrowing can be considered as a mapping (or semantics) S : T(J-, X) — » 
V(Subst(J : ,X) x T(T, X)) that associates a set of pairs (substitution,term) to an 
input term IjHanus and Lucas 2001(1 . The following is a typical evaluation semantics 
based on narrowing 

eval narr (t) = {(a, s) \ t s A s £ T(C, X)} 

The substitutions computed by narrowing are usually restricted to the variables 
of the input term. Within this semantic framework, the idea of redundancy for 
term rewriting as proposed in Definition ^ cannot be naively lifted to redundancy 
for narrowing (considering arbitrary input terms), as revealed by the following 
example. 

Example 23 

Consider the TRS of Example ^ The first argument of symbol lastnew is re- 
dundant w.r.t. eval n arr, i.e., for all contexts C[] and for all t, s S T(J-,X) such 
that root(t) — f, eval narr (C[i]) = eval narr (C[£[s]i]). For instance, with the input 
term t — lastnew(x,0:nil,s(0)), we have eval narr (i[s]i) = {(id, s(0))} for all 
s G T(T ", X). This is because, in every lhs, the first argument is a variable that is 
never inspected in the corresponding rhs. However, the second argument of symbol 
lastnew is not redundant w.r.t. eval narr . Consider the goal t' = lastnew (x,y , s (0) ), 
then eval narr (i') = {([y i-> nil], s(0)), ([y ^ w:nil], s(0)), . . .} ^ {(id, s(0)>} = 
eval narr (i'[nil]2). The reason is that there exist many narrowing derivations for t'\ 

lastnew(x,y,s(0)) ^{y^nil} s(0), lastnew(x,y , s (0) ) ~^{yH^ w:n ii} s(0), ... 

but only this one for i'fnil^: lastnew(x,nil,s(0)) ~*id s(0). 

Thus, the general problem of analyzing redundancy w.r.t. the observable of com- 
puted answers is a challenging line of research that we pursue as future work (hence 
outside the scope of this paper). Nevertheless, we can still outline different possi- 
bilities for analyzing redundancy of arguments w.r.t. narrowing in some particular 
cases by applying the results in this paper. 

Restriction to the variable case We have seen in Example l23l that the naive notion of 
redundancy for narrowing is still fruitful when we consider the case of an argument 
in lhs's that always corresponds to a variable that is never inspected during the 
computation, i.e. the Variable Case in Section 16.11 For instance, using Theorem 
1101 we can identify that the first argument of symbol lastnew in Example Q is 
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redundant for narrowing and that also the second argument of symbol loop in 
Example [5] is redundant for narrowing. 

Input terms with mode information Since the narrowing space is bigger than the 
rewriting space, the functional logic community (as well as the program transfor- 
mation and partial evaluation community) usually restrict their interest to preserve 
the narrowing semantics eval narr for a fixed set of goals, similarly to the argument 
filtering technique of dLeuschel and S0rensen 1 996) for logic programming. 

Example 24 

Consider again the TRS of Example Let us assume that we are only interested 
in the evaluation semantics of input terms that fit the shape lastnew (G ,G ,NG), 
where G denotes a ground term and NG an arbitrary term. This is known as mode 
information in logic programming and implies that the first and second arguments 
of symbol lastnew are understood only as input data whereas the third argument 
is understood as input and output data. Then the techniques presented in Section 
El can be applied to the arguments that are labeled with G. For instance, the first 
and second arguments of lastnew will be detected as redundant for the mode 
lastnew (G,G,NG). 

As mentioned before, more research is needed in order to come upon a generally 
correct notion of redundancy w.r.t. narrowing, which leads to effective detection 
algorithms that pay off in practice. We believe that our results in this paper can 
be valuable for these studies. 

11 Conclusion 

This work provides the first results concerning the detection and removal of useless 
arguments in program functions. We developed our results in a stepwise manner. 
We have given a semantic definition of redundancy which takes the semantics S 
as a parameter. We have considered different (reduction) semantics, including the 
standard normalization semantics (typical of pure rewriting) and the evaluation 
semantics (closer to functional programming). We have provided some decidability 
results about redundancy of an argument and a first effective method for detecting 
redundancies, which is based on approximation techniques. We have also provided 
two more practical methods to recognize redundancy which allows us to simplify 
the general redundancy problem to the analysis of the rhs's of the program rules. 
All the three methods to detect redundancies are different and useful. Moreover, 
we think that all results in this paper are of independent interest and can be used 
for other applications in the fields of rule-based and multi-paradigm declarative 
programming. 

Actually, inefficiencies caused by the redundancy of arguments cannot be avoided 
by using standard reduction strategies. Therefore, we have developed a transfor- 
mation for eliminating dead code which appears in the form of useless function 
calls and we have proven that the transformation preserves the semantics (and 
some operational properties) of the original program under ascertained conditions. 
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The optimized program that we produce cannot be created as the result of ap- 
plying standard transformations of functional programming to the original pro- 
gram, such as partial evaluation, supercompilation, and deforestation, see e.g., 
UPettorossi and Proietti 19 96a) . 

Furthermore, a prototype implementation of the (more practical) methods to 
detect redundancy together with the erasure procedure has been provided. The 
preliminary experiments performed with the prototype indicate that our approach 
is both practical and useful. We believe that the semantic grounds for redundancy 
analyses and elimination laid in this work may foster further insights and develop- 
ments in the program optimization community and neighbouring fields. 

Finally, apart from these comments, the problem of identifying redundant argu- 
ments of function symbols has been reduced to proving the validity of a particular 
class of inductive theorems in the equational theory of confluent, eval^-defined 
TRSs. We refer to ( |Alpuente et al. 2002a| ) for details, where a comparison with 
approximation methods based on abstract interpretation can also be found. 
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Appendix A Proofs 
Proofs of Section ^] 

Theorem Q 

Let S, S' be term semantics for a signature T . If S < S', then, for all / G J 7 , 
rarg s ,(f) C rarg s (f). 

Proof 

By contradiction. Given / G J- and i G rarg 5 r(f), by Definition ^ we have that, for 
all contexts C[ ] and for all t, s G T{T) such that root(t) = f, S'(C[t]) = S'(C[t[s]i\). 
Now, since S ^ S', there exists T C T(JF) such that S(C[i]) = S'(C[i]) n T = 
S'(C[t[s]i]) flT = S(C[i[s]i]). Hence, i G rarg s (f). □ 

Let 72. be a TRS such that |T(C)| > 1, and consider a rewriting semantics S such 
that eval-R, ^ S. Then, for all c € C, rarg s (c) = 0. 

Proof 

We prove by contradiction that rarg eval ^ (c) = 0, and then the conclusion follows 
by Theorem H Let t € T*(C) be such that root{t) = c. If i G rar 9eva\ n ( c ) > then 
eval7j(i[s]i) = eval7j.(i[s']j) for any s, s' G s.t. s 7^ s', thus contradicting 

Proofs of Section [5| 

Lemma 1 

For terms t, t', and position p G Vos{t) nVos(t'), the predicate equiv(t, t',p) = 3s G 
T(JF).t' = t[s] p is WSft;S definable. 

Proof 

Assume that the term t is represented by T, and the term t' is represented by T 1 . 
Then: 

equiv{f,f',p) = f V ? .(^(p < 9 ) A/ e ^(<7 &T f ^qG T' f )) □ 
Proposition 8 

Let S be a term semantics for a signature J 7 , f E J-, and i G {1, . . . , ar(/)}. The 
z-th argument of / is redundant w.r.t. S if for all term t G T{!F), for all p G Posit) 
such that root(t| p ) = /, and for all t' G such that equiv(t,t' ,p.i) is true, 

S(t) = S(f). 

Proo/ 

Immediate. □ 

Let S be a term semantics for a signature JF. If S is WS/cS definable, then redundancy 
w.r.t. S is decidable. 
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Proof 

By using Proposition [S] Assuming that S is denned in WSfcS by the formula 
Q(X,Y), redundancy is WSfcS definable by using the following formula: 

VT VS VW Vp G T.(Term(f) A Term(S) A Term(W) A 
p E T f A equiv(T, S,p.i) ($(f , #) <^> $(5, #))) 

Now, by Lemma Hand Theorem redundancy is decidable. □ 
Proposition 

Let S,S' be term semantics for a signature T. If S ^ S', S' is WSfcS definable, and 
there exists a window set T C T(.F) of S' w.r.t. S which is WSfcS definable, then S 
is WSfcS definable. 

Proof 

Assuming that semantics S' is defined by the WSfcS formula $'(A, Y), and that set 
T is defined by the WSfcS formula fl(X), we build the following formula defining 

S: $(X,Y) d = <f>'(X,Y) Afi(y) □ 
P/ieorem^] 

The set HNF K of a finite left-linear, right-ground TRS TZ is WSfcS definable. 
Proo/ 

Since the set REDEX^ of all redexes of a TRS TZ is WSfcS definable IjGallier and Book 1985jl . 
and the set H£>)[ L ] = i l e r ( :F ) I 3s e L± — > tz «} is WSfcS definable 7 for any 
regular set of terms L. We can formulate the set HNF-r as: <&(X) = -if2(X), where 
the set (— ^[REDEX-k] is defined by the predicate tt(X). □ 

Theorem\^ 

For a left-linear, right-ground TRS TZ over a finite signature J 7 , the redundancy 
w.r.t. semantics red-R,, hnfn, nf-R., and eval-^ is decidable. 

Proof 

Since the semantics red^ is WSfcS definable, and the sets NF-r and T(C) are WSfcS 
definable, by Proposition [21 we obtain that the semantics nf^ and eval^ are WSfcS 
definable. Then, by considering also Theorem0]we obtain that the semantics hnf^j 
is WSfcS definable. Finally, by Theorem |3 redundancy is decidable for semantics 
redn, hnf TC , nf TC , and eval K . □ 

Proofs of Section \5. 1\ 

Theorem\Q 

Let TZ be a TRS, TZ' be an approximation of 1Z, f G J 7 , i G {1, . . . ,ar(f)}, and 
S G {eval, nf}. If TZ is S-^-defined and S-r< is determined w.r.t. / and i, then i G 
™rg s ^{f). 

7 Actually, t he set (— is recognizab le for every recognizable tree language L IComon 20001. 
Hence, by {Thatcher and Wright 1968> it is WSfcS definable. 
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Proof 

We prove the result for eval^; the proof for nf is analogous. Note that, since 1Z is 
eval^-defined, |eval7j.(C[f])| > 1 and |eval-ft(C[i[s]i])| > 1. Moreover, since eval^/ is 
determined w.r.t. / and i, \eva\-jz(C[t])\ = 1 and |eval7j(C[i[s]j])| = 1. Otherwise, 
since (7[t[fi]j] -vtjy C[t], C[t[il]i] —>\, C[t[s]i], — ^C— and the constructor 
symbols of 1Z and TV are identical (since NF^ = NF-ji> and Jl is a defined symbol), 
we would also have eval^j./ (C[t [Sl].j]) | > 1. 

Assume that i g" rarg eva]K (/). Then, there exist C[ ], t G T(!F) such that root(t) = 
/, ands G T{F) such that eva\ K (C[t}) ^ eval TC (C[t[s]. i ]). Then, since |eval K (C[t])| = 
1 and |eval-ft(C[£[s]i])| = 1, it follows that S G eva\ n (C[t}) and 8' G evalrc(C[t[s]i]) 
verify 8 ^ 8'. By reasoning as above, this would mean that eval^j./ (C[t [rz].j]) | > 1 
thus leading to a contradiction. □ 

Theorem^ 

Let S be a term semantics for a signature T U {SI}. If S is WSfcS definable, then it 
is decidable whether S is determined w.r.t. / and i. 

Proof 

Assuming that S is defined in WSfcS by the formula $(A, Y), the property is WSfcS 
definable by using the following formula: 

VT Vp Term(f) ApeT/A p.i G Tq => (Vg G To, q = p.i) A 
ySTerm(S) A $(T, S) A VW {Term{W) A $(f, W> = S) 

Now, by Theorem |3 the conclusion follows. □ 
Theorem^ 

Let K be a left-linear TRS, 7?. r9 be the approximation rg of 1Z, f G J 7 , i G 
{1, . . . , ar(/)}, and S G {eval-R, rg , nf^}. It is decidable whether S is determined 
w.r.t. / and i. 

Proof 

Since lZ rg is a left-linear and right ground TRS, the semantics red-ji rg is WSfcS 
definable. The sets NF-r. and T(C) are WSfcS definable and, by Proposition |21 we 
obtain that the semantics nf^ and e\ia\-ji rg are WSfcS definable. Then, by Theo- 
rem we obtain that the property VC[ ], t G T(J-), root(t) = f,i G {1, . . . , or(/)}, 
|S(C[i[f2]i])| < 1 is decidable for S G {eval TCrg , nf TCrg }. □ 

Proofs of Section Q?| 

Theorem 

Let S, S' be term semantics for a signature .F. If S ^ S', then, for all t G T(F, A"), 
rpos s ,(t) C rpos s (t). 
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Proof 



By contradiction. Given t e T(J-, X) and p G rpos S /(i), by Definition 0] we have 
that, for all t', s G T{J-) such that £ and t' are p-prefix-equal, S(i') = S(t'[s] p ). Now, 
since S ^ S', there exists T C T(J r ) such that S(t') = S'(t') n T = S'(t'{s] p ]) n T = 
S(t'[s] p ]). Hence, p G rpos s (t). □ 

Proposition [3] 

Let 7?. be a TRS such that |T(C)| > 1, and S be a rewriting semantics such that 
eval TC ^ S. Then, for all t € T(C, X), rpos s (t) = 0. 



We prove by contradiction that rpos eva \ (t) — 0, and then the conclusion follows 
by Theorem El Let t G T(C,X). If p € rpos^ K (t), then for all t' G T(J r ) and 
s, s' G T(C) s.t. £ and t' are p-prefix-equal and s ^ s', eva\n(t' [s] p ) — eva\n(t'[s'] p ). 
In concrete, for t' G T(C) s.t. i' = er(t) for some a G Subst(C), we should have 
eval7e(f'[s] p ) = {i'[s] p } = {i'[s']p} = eval7j(f'[s'] p ), thus contradicting s^s'. □ 

Proposition^] 

Let S be a term semantics for a signature T,tG T(T, X), p G Vos(t), f G £>. For all 
positions <7,p' and i G rarg 5 (f) such that p = g.i.p' and 7"oof(f| g ) = /, p G rpos 5 (t) 
holds. 

Proo/ 

Let t' = C[/(i)] g such that i and f' are equal down to p. Since i G rarg s (f), for all 
term s G T(.T), S(C[/(i)] g ) = S(C[/(i)[s],;],). In particular, if s' = t'[s] qA \ p , then 
S(i') = S(t'[s']p) and the conclusion follows. □ 



We recall here the notion of descendants of a position in a rewrite sequence. This 
notion is usually meaningful for orthogonal TRS's (since descendants of redexes, 
called residuals, are also redexes) but it makes sense for arbitrary TRS's; we must 
always provide the concrete rule applied at each rewriting step, see (|TeReSe 20031 
Section 4.2). 

Definition 16 \Huet and Levy 

Let A : t — >;^ r s and q G Vos(t). The set q\A (alternatively, q\(p,l—>r)) of 
descendants of q in s is defined as follows: 



If Q C Vos{t) then Q\A denotes the set U 9 eQ <l\A- The notion of descendant 
extends to rewrite sequences in the obvious way. If Q is a set of pairwise disjoint 
positions in t and A : t — ►* s, then the positions in Q\A are pairwise disjoints. 



Proof 



Proofs of Section \6.1\ 



q\A 




if q < p or q || p, 
l\ Pl } if q=p.p\.pi withpi eVos x (l), 
otherwise. 
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Proposition^ 

Let Kbea left-linear CS, / € T>, and i G {1, . . . , ar(f)}. Let t G T(T), P C 
PoSf } i(t) U rpos eva! (i) be a set of disjoint positions, and s € T{!F). Let i — ►* (5 for 
some (5 £ ?~(C). If, for alii — > r G 72-/, 2|{ is a variable which is (/, i)-redundant in 
r, then t[s] P — >•* 5. 

Proof 

We prove by induction on n, t — > n 5. We assume |T(C)| > 1, which is necessary for 
Proposition [3] used below; otherwise the proof is trivial. 

1. If n = 0, then t = S, Posf t i(t) = 0, and by Proposition rpos ev3 ^(t) = 0. 
Hence, P = and t\s]p = t = 5. 

2. If n > 0, then t -%^ r t' S. Let P = P n Pos f , t {t) and P rJ)os = P \ P. 
Now, we prove that f[s|p i ]p i — ►* S. 

• (rooi(Z) ^ /) Let P q = {p e P l \ q < p} and P/ = P \ P ? . Then, 
P(\(q,l—*r) = P!. By induction hypothesis, i[s|p;]p; r t'\?\p!]p' — ►* 
S (if |j> G P/.p < q), or i[s| P/ ] P < = t'[s\ P .] P ' -+* * (if 3p G P/.p < g). Now, 
we prove t[s| P .'upJp/up g 

We have P q \{q,l^r) C Pos/^i'), since 7?. is a CS and this implies each 
position in P 9 is under a variable of I, i.e., Vg.p € P g , Bp' G Vosx(l)-y' < P- 
Then, £[s| P ' UP J P < up? — > r f° r some i" G T{T), since 7?. is left-linear. 
Now P/ U (P q \{q,l~>r)) C Pos fli (t') and, by induction hyphotesis, t" = 
*'[" y ]p/u(P,\(g,i^r>) ->* * for some uJ G T(J"). 
Hence, t[s|p 4 ]p 4 — ►* (5. 

• (root(l) = f) Let P 9 . s ; = {p G P g.i < p} and P/ = Pj \ P g .j. As in the 
previous case, we have i[s|p']p' — >* <5. Now, we prove t[s\p' U p Jp'up ; 

S. 

We have that t[s|p/up g Jp.'up, < —^i^r t" for some i" G T(J-), since Z|j 
is a variable, say x, and 72. is left-linear. Since x is (/, z)-redundant in 
r, q.Vos x (r) C Posj^{t') U ?"pos eva | (i'). We also have that for all p G 
Pq.i\{q, l—^r), there is p' G q.Vos x {r) such that p' < p. Hence, by induction 
hypothesis, t" = t'\w]pt u p ,\( 5 i-t r ) ^* ^ f° r some w G T(T). 
Then, t^lp-up^Jp/up^ -►* <5, i.e., i[s|pjp 6. 

Finally, (i[s| Pi ] Pi )[s|p rpos ]p rTOS 6 by definition. Hence, t[s] P — >■* 5. □ 
Theorem UfH 

Let K be a left-linear CS. Let / G 7? and i G {1, . . . , ar(f)}. If, for all I -v r G 72/, 
Z|i is a variable which is (/, i) -redundant in r, then i G rarg eval _(/). 

Proo/ 

Let C[ ] be a context such that C| p = □, and t,sE T(T) be terms such that 
root(t) = f. By Proposition^ M5 G T(C) s.t. C[t] S, C[t[s]i] S and viceversa. 
Hence, eva\n(C[i\) = eva\-ji(C{t{s)i}). □ 
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Proofs of Section \6.2\ 

Proposition^ 

Let TZ be a left-linear, confluent, and eval^-defined CS. Let / G V and i G 
{1, . . . , ar(f)}. Let t G T(F), P ^ Posf t i(t) U Tos eva | TC (i) be a set of disjoint 
positions, and a be a constant. Let t — ►* (5 for some (5 G T(C). If 7?. is (/, i)-joinable, 
then t[a] P — >* S. 

Proof 

We prove by induction on n, t — >™ (5. We assume |P(C)| > 1, which is necessary for 
Proposition |2| used below; otherwise the proof is trivial. 

1. If n = 0, then t — S, Posf^{t) — 0, and by Proposition 7T> 0S evai TC W = ®- 
Hence, P = and t[a] P = t = 8. 

2. If n > 0, then i £' -v"- 1 5. Let P = P n Pos f ^(t) and P rpo;5 = P \ p. 
Now, we prove that ifajp; — ►* <5. 

• (rooi(Z) ^ /) Let P q = {p G Pi \ q < p} and P( = P \ P q . Then, 
P(\{q, I— >r) = P/. By induction hypothesis, i[a] P ' -^i^ r t'\a) P i — ►* S (if 

G P/.p < o), or i[a]p' = i'[a]p? — ►* <5 (if 3p G P[.p < q). Now, we prove 
t[a\p[up q <5. 

We have P q \(q, I— *r) C Posf,i(t'), since 7?. is a CS and this implies each 
position in P q is under a variable of I, i.e., Vg.p G P q , 3p' G Vosx(l)-p' < p. 
Then, t[a]p' U p i — >;_> r t" for some t" G T{F), since 7?. is left-linear. 
Now P[ U (P q \(q,l^r)) C Pos f .i(t') and by induction hyphotesis, t" = 

*'[a]p/U(P,\(g,/^r)) -►* 5. 

Hence, i[a]p i — »* 5. 

• (root(l) = /) Since P is a disjoint set, we have P/ — Pi \ {q-i}- As in the 
previous case, we have t[a]p> — ►* S. Now, we prove i[a] g .i — ►* 5. 

— (l\i G X) Let i|j = x. Then, t[a] q ,i — l -^i^ r t'\a] q ,p OSx r r ). Since 1Z is 
(/, i)-joinable, q.Vos x (r) C Pos /^(t^Lirpos^ (f). Thus, by induction 
hypothesis, £'[a] g .p OS:c ( r ) ->* (5. 

— (Z|i A 7 ) By eval^-definedness, there exist I' —>■ r' G TZf and cr' G 
Subst(F) such that t[a] 9 .i| g ^* cr'(Z'). Assume V — > r' and Z — > r are 
different rules; otherwise Z | ^ — t\i = a and it is trivial. Then, I and I' 
unify up to the i-th argument with mgu 9. Thus, {I — > r, /' — > r', 0) is a 
joinable (/, z)-triple of 1Z. 

Moreover, we have that there exist substitutions a, (p, ip' G Subst(J-) 
that split a and a' in terms of 9, i.e., such that er(Z) — >* cr(l), a — ipo9, 
and tr' = ip' o 9. By joinability of (/, i)-triples, there exists w G T(J~, X) 
such that 9 c {n(r)) — »* w and 6c(n'(r')) —y* w. By stability of —**, 
95 o 9c(ri(r)) — »•* iy2(w;) and <// o #c( r Z'( r ')) i P'( w )- By definition of 
6» c and left-linearity, ^(z) = ^'(a;) for a; Var(6>(Z)|i) U Var(6»(Z')|i, 
and thus = f'i 110 )- Summarizing, we have t[ip o 9(r)] q —>* 5, 

t[ipo9 c {Ti(r))] q ^* t[ip{w)] q , t[p' oe c {T V {r'))] q t[ip{w)] q , andi^'o 
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8{r')] q -^>* 8' for some 6' G T(C). Now, we have to prove that t[(f(w)] q — > : 
S and 5' = S. 

Consider the set P C Vos{t') of positions where t[ip o 6(r)] q and t[tp o 
9c( r )]q differ. By definition of 77, P C Posj^{t') U rpos eva | TC (£'). By in- 
duction hypothesis, £[^> o #eCn( r ))]g ^* ^- Thus, by confluence, i[y> o 
»c(7l(r))], ->* ^* 5. And also t[<f/o9c{n> (r'))] q -»* t[p(tw)] g -> 

Finally, we prove that 5' = 5. 

- If £ then = a, Var(7'|i) = 0, and ip' o e c {T V {r')) = 
ip' o9(r'). Thus, J' = (5. 

— If G A", then = a; and, by definition of tj<, o 6c{Ti'(r')) = 
f' ° 0c(r'). Since 0c 0&) = a , we have o 6c(r') = tp' o 9(r'). Thus 
5' = 5. 

Then, t[s\ P > u{qA} } P > u{q . l} ->* 5, i.e., i[s| Pl ] Pl -►* 5. 
Finally, (i[s|pJpJ[s|p., pos ]p rpos ^* * by definition. Hence, f[s] P ^* (5. □ 



Theorem I 

Let 1Z be a left-linear, confluent and eval^-dcfincd CS. Let / G T> and 
i G {1, .. . ,ar(f)}. If TZ is (/, i)-joinable, then i G rarg eval ^(f). 



Proof 

Let a G C be a constant. Let C[ ] be a context such that C\ p = □, and t, s G 
T{F) be terms such that root(t) — f. By Proposition |SJ we have that V<5 G T(C) 
s.t. C[t] ->* J, C[i[c] 4 ] ->* 5. By confluence, eval TC (C[t]) = &/a\ n (C[t[a]i]) and 
eval w (C[i[a]i]) = eval w (C[t[o]i]). Hence, eval K (C[i]) = eval TC (C[i[s]i]). □ 



Proofs of Section \T\ 

Proposition 9 

Let p be a syntactic erasure for a signature J-, t £ T(J-, X) and cr G Subst(J r , X). 
Let cr p G Subst(3- p , X) be such that a p (x) = t p (<j(x)) for all x € X. Then, r p (a(t)) — 
o-p(T p (t)). 

Proof 

By structural induction. If t = x G A", then the result is immediate, since r p (x) = 
x. For the induction step, we take t = f(ti,...,t n ) for t\,...,t n G T(T, X). 
Then, 7>(ff(t)) = Tpiaifih,...,^))) = T p (f(a(h), . . . , a(t n ))) = 
/p( T >(o'(*ii)) ) ---! 7 >( cr (*ife))) I where {1, . . . , n} - p(/) = and i m < 

i m +i for 1 < m < k. By induction hypothesis, f p (T p (a(t ll )), . . . ,T p (a(t ik ))) = 
/pK( t p(^i)),---^p( t p(>*J))- And finally, f p (<7 p (T p (t n )),..., CTpfofc J)) = 
o-pifpirpiU, ),..., Tp(iiJ)) =tr p (r p (i)). □ 
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Lemma 2 

Let S be a term semantics for a signature T '. Let / G J-, and / C rarg$(f). Then, 
for all contexts C[ ] and for all t, si, . . . , sj. G T{!F) such that root(t) = /, S(C[i\) = 
S(C[t[3fc]/]). 

Proof 

By induction on fc = |J|. If A; = 0, it is immediate. If k > 0, let i G / and = 
si, . . . , Sj_i, Sj+i, . . . , Sfc. By the induction hypothesis, S(C[f]) = S(C[t[sJ c _ 1 ]/_{ i }]). 
Since i is redundant w.r.t. S, S(C[i[sJ £ _ 1 ]/_{ i }]) = S(C[t[sfc]/]), i.e., S(C[i]) = 
S(C[t[lk]j]). □ 

Lemma 3 

Let S be a term semantics for a signature J 7 . Let /, g G J 7 , and i G rar<7 S (/), 
j G rarg s (g). Then, for all contexts C[ ] and for all t,t',s,s' G T(^ r ) such that 
root(t) = / and root{t') = 3 , S(C*[t,t']) = S(C[t[s]i, 

Proof 

Let C"[] be the context C'[] = C[D,t']. By redundancy of i, we have S(C[t,t'\) = 
S(C[t]) = S(C"[t[s]i]). By redundancy of j, S(C"[i[s]j]) = S(C[t[s] t },t']) = 

S(C[i[s] l ,t'[ S '] J ]), i.e., S(CM'D = s(c[t[ s ];,t'[ s 'W)- □ 

Given a syntactic erasure p and a term t G T(J-), we define the maximal non- 
redundant context MNRC p (t) of £ as MNRC p (t) = i[D] pi . p(/l ) ■ • ■ [□] Pii . p(/?i ), where 
pi, . . . , p n are the positions of all outermost subterms rooted by symbols f\ , . . . , /„ 
such that p{fi) ^ for 1 < i < n. 

Proposition^ 

If the syntactic erasure p : T — > "P(N) is sound with respect to the semantics S, 
then for all t, s G T{F), t = Tp s implies that S(t) = S(s). 

Proof 

By induction on the structure of MNRC p {t) and using Lemmata and [3J □ 
Theorem \13\ 

Let 7?. be a left-linear TRS, S be a rewriting semantics for 1Z such that S ^ red^, p 
be a sound syntactic erasure for S, and t, S G T{T P ). If i 5, then Vt', (5' G T(.F) 
such that T p (t') = f and t>(<5') = 5, S(S') C S(i'). 

Proof 

By induction on the length to of the derivation i -^?j p (5. If to = 0, then t = 5, 
and for all t',8' G T(J r ) such that r p (t') = f and t p (6') = 6, by Proposition 

s(0 = s(*'). 

If to > 0, then f ^tz p s 6. Consider t",5" G T{T) such that r p (t") = t 

and T P (S") = 5. First we prove, by induction on p, that there exist t', s' such that 

T p (t') = t, t p (s') = s, and i' ^-> K s'. 
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1. If p = A, then t = a (I) for some I — ► r in 7?. p . Then, by Definition 1141 there 
exists I' —> r' E TZ such that r p (Z') = Z and cri>(T p (r')) — r. Now, there exist a 
term t' G and a substitution a' such that r p (t') = t and t' = <j'(1'). Then, 
by left-linearity, for all x in Var(l'), if x G Var(Z), we have cr(x) — t p (<j'(x)). 

Otherwise, let l'\ q = x, a(x) — t'\ q . Hence, t' —>iz cr'(r'). 

2. If p = i.q, then we consider the terms t = f p (t\, . . . , tk), s = f p (si,...,Sk), 
t' = f(t[, ... ,t' n ) and s' = /(si, . . .,s' n ), such that k = n-\p{f)\. Then,p' = i'.q', 
where i = \{1 < i'} — p(f)\, and for all j,j' s.t. 1 < j < k and 1 < j' < n, 
j = \{1 < j'} — p(f)\, tj = T p (t'j,), Sj — T p (s'j,). By the induction hypothesis, the 
conclusion follows. 

Now, by induction hypothesis, for all w, w' G such that t p (w) = s and 

t p (w') = 5, we have that S(w') C S(w). Thus, since w = Tp s' and w' = Tp S' ', by 
Proposition[7| we have that S(5') C S(s'). By definition of red, redes') C redn(t'). 
Then, let T be the window set such that S ^ red TC , redes') n T C red n (t') n T, 
and thus, S(s') C S(t'). Hence, we obtain that S(<5') C S(i'). But, by Proposition [7| 
S(t") = S(t') and S(S") = S(S'); thus, the conclusion follows. □ 

Theorem \l e A 

Let 1Z be a left-linear TRS, S be a rewriting semantics for 1Z, p be a sound syntactic 
erasure for S, and t G T(J r ). If 5 G S(t), then r p (i) —>\ p t p (S). 

Proof 

Let i' = C[c, . . . , c], where C[D, ...,□] = MNRC p (t) and c e f is the constant 
used in 1Z P . Since t = p t', by Proposition |7| t — >^ £ if and only if t' — ^ 5. Now we 
prove, by induction on the length to of derivation t' — ^ <5 that r p (t') — r p(^)- 
If m = 0, then t' ~ S and the result is immediate. If to > 0, we let t' A TC s — ^ S. 
By induction on p, we prove that either T p (t') -^tz p t p (s) or T p (t') — t p (s). 

1. If p = A, then there exists / — > r in 1Z such that = a(l) and s = u(r). By Propo- 
sition T p (a(l)) = <7 p (t p (1)) and T p (er(r)) = o- p (T p (r)) where ct p (:e) = T p (a(x)) 
for all cc G X. Left-linearity of 1Z ensures that, every variable x that occurs 
within an erasable subterm of / (i.e., a subterm l\ p such that there exists q.i < p 
such that i G p(root(l\ q ))) does not occur in t p (1). Thus, when considering 
x G Var{l) — Var(r p (/)), by definition of t' , it must be a(x) = c. Hence, 
cr p( T p( r )) — a p( a i( T p( r ))) where ai is fixed as in Definition ITol Thus, by def- 
inition of 1Z P , T p (t') -^ Up t p (s). 

2. If p = i.q, then we let t' — f(t[, . . . , t' k ) and s = f{s\, . . . , Sk) and consider two 
cases: 

(a) If i G p(f), then T p (t') = r p (s) since t only differs from s in the i-th 
argument ti of / in t' (which is removed by t p ). 

(b) If i G' p( f), then, the i-th argument of / in t becomes the (transformed) 
j-th argument T p (ti) of / in T p (t'), where j = \{l < i} — p(f)\- By the 
induction hypothesis, either r p (^) ~^n p T p{ s i) or T p(t'i) = T p( s i)- In both 
cases, the conclusion follows. 
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Therefore, we have that either T p {t') — ► t p (s) or T p (t') = t p (s). By the induction 
hypothesis, t p (s) T p{$)- Thus, T p {t') -+7^ t p {5). Since T p (t) = T p (t'), the 

conclusion follows. □ 

Theorem \ 1J\ 

Let TZ be a left-linear TRS, p be a sound syntactic erasure for eval^, t £ T{!F), and 
6 6 T{C). Then, 7>(t) ->* Up 5 iff 5 £ eval w (t). 

Proof 

Immediate from Theorem 1 131 and Theorem II 21 □ 
Theorem ] 15\ 

Let TZ be a left-linear TRS. Let p be a sound syntactic erasure for eval-R.. If TZ is 
eval^-defined and confluent, then the erasure 1Z P of TZ is confluent. 

Proof 

Given t e T(T p ), if ii ^-s— t — ^ *2 with £ x 7^ i 2 > by Theorem El there exist 
s, si, s 2 e such that r p (s) = t, t p (si) — t%, t p {s2) — t 2 , eva\n(si) C evata(s), 

and eva\-jz(s2) C eval?j.(s). 

Since eval-R. is 7£-normalized, and 7?. is confluent and evata-defined, evalrc(s) is 
a singleton consisting of the normal form t'. Moreover, eva\u{si) — eval-7j(s 2 ) — 
eval-R.(s), and by Theorem [T51 t\ ~+tz p T p (t') ^j?*— t 2 . □ 

Theorem \lb\ 

Let 71 be a left-linear and completely defined TRS, and p be a sound syntactic 
erasure for eval-^.. If 7?. is normalizing, then the erasure 7Z P of TZ is normalizing. 

Proof 

Since 7\L is normalizing and completely defined, Vt 6 T{F), 3£ e T(C) £ evalrc(i). 
Then, by Theorem El T p(t) ~^n p t p(3)> an dj by Proposition r p ((5) = 5. Hence, 
the conclusion follows. □ 



Appendix B Benchmarks Code 

We give some example programs which contain redundant arguments, borrowed 
from the literature and/or obtained by applying common transformation processes. 
For each example, we show the final program which results from optimizing the 
program by using our automatic redundant argument removal prototype. Programs 
are given in the (currified) functional programming syntax used by Curry. 
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Program bogus 

The following program bogus is borrowed from ( |Kobayashi 2000 W and and Siveroni 1 999). 
where it is introduced for useless variable elimination (UVE), a popular technique 
for removing dead variables, 
data Nat = Z I S Nat 

loop : : Nat -> Nat -> Nat -> Nat 

loop a bogus Z = loop (S a) (S bogus) (S Z) 

loop a bogus (S x) = a 

The second argument of loop is signaled as redundant and then removed. 

loop' :: Nat -> Nat -> Nat 

loop' a Z = loop' (S a) (S Z) 

loop' a (S x) = a 

Program applast 

The following program is borrowed from IjLeuschel 1998f) and is obtained by pro- 
gram specialization ( |Alpuente et al. 1998). 
data Nat = I S Nat 

append: : [Nat] -> [Nat] -> [Nat] last :: [Nat] -> Nat 

append nil y = y last (x:nil) = x 

append (x:xs) y = x: (append xs y) last (x:y:ys) = last (y:ys) 

The specialization of the program applast for goal 'last (append xs (x:nil))' 
yields: 

applast :: [Nat] -> Nat -> Nat lastnew::Nat -> [Nat] -> Nat -> Nat 

applast nil z = z lastnew x nil z = z 

applast (x:xs) z = lastnew x xs z lastnew x (y:ys) z = lastnew y ys z 

The first argument of applast and the first and second arguments of lastnew arc 

identified as redundant and removed. 

applast' :: Nat -> Nat lastnew' :: Nat -> Nat 

applast' z = z lastnew' z = z 

Program plus_minus 

This example is borrowed from IjLeuschel 1998(1 and is obtained by program spe- 
cialization ( |Alpuente et al. 1 998). 

data Nat = Z I S Nat 

plus : : Nat -> Nat -> Nat minus : : Nat -> Nat -> Nat 

plus Z x = x minus x Z = x 

plus (S x) y = S (plus x y) minus (S x) (S y) = minus x y 

The specialization for goal 'minus (plus x y) x' yields: 

minus_pe : : Nat -> Nat -> Nat 
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minus_pe Z y = y 

minus_pe (S x) y = minus_pe x y 

The first argument of minus_pe is identified as redundant and removed. 

mimis_pe ' : : Nat -> Nat 
minus_pe ' y = y 

Program plus_leq 

This example is borrowed from IjLeuschel 1998(1 and is obtained by program spe- 
cialization ( jAlpuente et al. 1 998). 

data Nat = Z I S Nat 

plus : : Nat -> Nat -> Nat leq : : Nat -> Nat -> Bool 

plus Z x = x leq Z x True 

plus (S x) y = S (plus x y) leq (S x) Z = False 

leq (S x) (S y) = leq x y 

The specialization for goal 'leq x (plus x y)' yields: 

leq_pe : : Nat -> Nat -> Bool 
leq_pe Z x = True 
leq_pe (S x) y = leq_pe x y 

Both arguments of leq_pe are identified as redundant and removed. 

leq_pe ' : : Bool 
leq_pe ' = True 

Program double_even 

This example is borrowed from IjLeuschel 1998(1 and is obtained by program spe- 
cialization ( |Alpuente et al. 1 998). 

data Nat = Z I S Nat 

double : : Nat -> Nat even : : Nat -> Bool 

double Z = Z even Z = True 

double (S x) = S (S (double x)) even (S Z) = False 

even (S (S x)) = even x 

The specialization for goal 'even (double x)' yields: 

even_pe : : Nat -> Bool 

even_pe Z = True 

even_pe (S x) = even_pe x 

The argument of even_pe is identified as redundant and removed. 

even_pe ' : : Bool 
even_pe ' = True 
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Program sum_allzeros 

This example is borrowed from IjLeuschel 1998(1 and is obtained by program spe- 
cialization ( |Alpuente et al. 1 998). 

data Nat = Z I S Nat 

plus : : Nat -> Nat -> Nat sum : : [Nat] -> Nat 

plus Z x = x sum nil = Z 

plus (S x) y = S (plus x y) sum (x:xs) = plus x (sum xs) 

allzeros : : [Nat] -> [Nat] 

allzeros nil = nil 

allzeros (x:xs) = Z: (allzeros xs) 

The specialization for goal 'sum (allzeros x)' yields: 

sum_pe : : [Nat] -> Nat 

sum_pe nil = Z 

sum_pe (x:xs) = sum_pe xs 

The argument of sum_pe is identified as redundant and removed. 

sum_pe ' : : Nat 
sum_pe ' = Z 

Mutual Recursion 1 

This program is taken from Example 2.23 of IjArts and Giesl 2001j) . 

data Nat = Z I S Nat 
f : : Nat -> Nat -> Nat 
f Z y = Z 
f (S x) y = f (f x y) y 

Both arguments of f are identified as redundant and removed, 
f ' : : Nat 
f = Z 

Mutual Recursion 2 

This program is taken from Example 2.24 of l|Arts and Giesl 2 001). 

data Nat = Z I S Nat 

f : : Nat -> Nat 

f Z = S Z 

f (S Z) = S Z 

f (S (S x)) = f (f (S x)) 

The argument of f is identified as redundant and removed. 

f ' : : Nat 
f> = S Z 



